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PREFACE 


This book is founded on the lecture course given in the 
University of St Andrews to students reading for an 
Ordinary Degree in Mathematics and to first -year Honours 
students. It contains a short and elementary account of 
algebraic equations, both from the theoretical and the 
practical side, together with the algebra of polynomials 
and rational fractions. It includes classes of number, 
partitions, identities, the G.C.M. process, partial fractions 
and recurring series, but it omits continued fractions and 
indeterminate equations. Cubic and biquadratic equations 
are discussed, together with more general types, elimina- 
tion, and symmetric functions, but the theory of invariants 
and of groups is left untouched. 

An elementary knowledge of algebra is presupposed, 
particularly of long division, quadratic equations and the 
binomial theorem, together with elementary determinants, 
coordinate geometry and differential calculus. The more 
advanced determinantal and matrical theory of p. 48 and 
p. 141 may be omitted on a first reading. 

The book has been written with the historical develop- 
ment of algebra constantly in mind ; and many of the 
topics have been selected for their importance, not only 
as part of a general mathematical education, but also as 
preliminaries to the study of all higher algebra. 

I acknowledge with gratitude the help which I have 
derived from the well-known treatises on algebra and 
equations written in England, Ireland, America and 
Germany : more particularly from the works of Todhunter 
and of Professor P. B. Fischer. I also thank the editors of 
the present series, my colleagues, and the printers for their 
helpful co-operation. H w . turnbuLL. 

vil 



PREFACE TO THE SECOND EDITION 

The text of the first edition has been revised, and I am 
grateful to friends who have pointed out to me various 
errors, which have been corrected in the present edition. 
A new chapter is added on general methods of root 
expansion, which links the early work of Newton with 
recent discoveries of the Edinburgh School of Algebraists, 
and affords an introduction to the study of bialternants. 

H. W. T. 
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CHAPTER I 


INTEGERS, MORE GENERAL TYPES OF NUMBER, 
POLYNOMIALS 

1. Integers and Partitions. The concepts of positive 
integers and of their partitions are fundamental in algebra. 
We may imagine the natural numbers 1, 2, 3, ... to be 
arranged in a sequence, as here, according to ascending 
order of magnitude. The n th term of the sequence is the 
positive integer n. JFor every integer n there is an integer 
w+1 immediately following ; so that the complete sequence 
of natural numbers has no last term. We may express 
this by saying that no positive integer is infinite, or in 
symbols by writing 0<w<oo. 

If r, s, . . . , z are k positive integers such that 

n =*= r+s + ... + z . • (i) 

then k must be some number between 1 and n inclusive, 
so that If n is fixed we may regard (1) as an 

equation to determine the unknowns r, s, . . . , z. Usually 
there are several possible solutions and the equation is 
said to be indeterminate ; but if k = 1 there is one 
solution r — n, and if k = n there is again one solution, 
r — s — ... — z — 1. The set {r, 8 } . . . , z } is called a 
partition of w, and it is customary to arrange the parts, 
or terms, r, s, ..., z in ascending, or else in descending, 
order. For example {112} is a partition of 4 into 1+1+2, 
and {346} of 13 into 3+4+6. Derangement of the terms 
is immaterial, so that we should reckon 1+2 + 1, 2+1 + 1 
or 1+1+2 to be the same partition. It is easy to verify 
that there are five distinct partitions when n = 4 : 

{1 1 1 1}, {1 1 2}, {1 3}, {2 2}, {4}. . (2) 

A 



2 


TYPES OF NUMBER 


These are sometimes represented graphically by 
X 

X X 

XX X XX 

X, XX, XXX, XX, XXX X, 


( 3 ) 


where rows of crosses take the place of parts or terms in 
the partition. In general the top row has r crosses, the 
second, s, ...» and the last, z . We may naturally enquire 
how many distinct partitions of n exist : and if we denote 
this number by pin), we can verify that p(l) = 1, p( 2) = 2, 
p(3) = 3, pi 4) = 5, p(5) = 7, but to find an expression 
for the general value pin) proves to be a singularly difficult 
undertaking, which has taxed the skill of the greatest 
matl i ernaticians. 

The five partitions of n = 4, arranged as above, are 
said to be in lexical order. This manner of arrange- 
ment becomes obvious when the integers 1, 2, 3, 
are replaced by the respective letters a , b , c, and 
each partition is regarded as a “ word,” for then the 
words fall into alphabetical, or lexical, order : aaaa , aab , 
ac , bb, d. Such an order is possible and unique for each 
value of n. 

Conjugate Partitions. The partitions {112} and {13} 
are said to be conjugate. Conjugacy is a mutual relation 
which becomes intuitive when the graphs of crosses are 
examined : the columns of one graph, read from left to 
right, have the same numbers of entries as the rows of 
the conjugate graph read from below to above. Conjugate 
graphs are therefore reflexions of each other in a line 
inclined at 45° to the rows or columns. Every partition 
possesses a conjugate or else is self -conjugate : {1111} and 
{4} are conjugate, {22} is self- conjugate. 

Example. — Obtain the partitions of n = 5, n = 6, and 
arrange them in conjugate pairs, and self-conjugate singles. 


2. Rational and Irrational Numbers. In a more 
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general view, integers are either positive, negative or 
zero, and can be arranged in ascending order . . . —3, —2, 
— 1 , 0, 1 , 2, 3, .... If n is any such integer then 
— oo <?i <oo. If p is any integer and q is any positive 
integer, the number p-i-q (or p/q) is a rational number, a 
concept which leads at once to the theory of fractions, 
proper and improper, of factors, multiples, prime numbers 
and pairs of numbers prime to one another (that is, which 
have no common factor other than unity), with all 
of which terms it will be supposed that the reader 
is familiar. By expressing a rational number in decimal 
form we may find that it either terminates or recurs : 
for example 241/100 = 2*41 which terminates, while 
12/11 = 1*090909... = 1*09 recurs. Conversely, every 
terminating or recurring decimal is reducible to a rational 
number of the form p/g, and when p and q are prime to 
each other the reduction is unique. But we may also 
contemplate decimals which neither terminate nor recur, 
such as 0*1234567891011... (which is composed of the 
digits of the positive integers in natural order), or 
1*4142... = \/2. These represent irrational numbers. 
We include under the name real both rational and irrational 
numbers. As it is impossible to make a complete list 
in ascending order of all the consecutive rationals (much 
less of irrationals) between two integers, it is natural to 
invoke the help of geometry and to represent these 
numbers by points on a straight line. Each real number x 
is then represented by a point N upon an unlimited axis 
X'OX, and x is called the coordinate of N referred to an 
origin 0 : and the number x is the distance ON in terms 
of a given unit distance. 

3. Polynomials, Equations, Complex Numbers. 

We may consider numbers from another point of view 
as the roots of equations. If n is a positive integer the 
expression 

f(x)=a ( p: n +a 1 x n - 1 -\-a^: n - i +...+a n _ 1 x+a n . (1) 
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is called a polynomial of order or degree n in the variable x 
(briefly, an n-ic in x ), whose coefficients a 0 , esq, a n are 
constants, of which a 0 is non-zero (a^O). The relation 
f(x) = 0 is then called an equation of order or degree n. 
It is a major problem of mathematics to determine the 
roots of this equation, that is, to find values of x for which 
the equation is true. The attempt to solve the equation — 
to find one or more such roots — has led to the concept of 
a new type of number, distinguishable from the real, and 
called a complex number. 

Let us write out the equation systematically for low 
values of n , using a, 6, c for the coefficients : 

ax-\-b = 0, a#0, 

ax 2 +bx+c = 0, , 2 v 

ax z -\-bx 2 -\-cx-\-d = 0, ’ * • w 

ax A +bx 3 +cx 2j rdx-\-e — 0, 


and so on. These are called the linear, quadratic, cubic, 
quartic (or biquadratic), ..., equation respectively. First 
let all the coefficients be integers : then the linear equation 
is soluble in rationals, namely x = —bja. Conversely 
every rational number is the root of a linear equation 
with integer coefficients ; for example 2*41 is the root of 
100# =241. In particular if a — 1 the root of such an 
equation is an integer. Rational roots may also exist 
for higher equations, as in 2a; 2 — 3a; +1 = 0, of which 
x = 1 is a root, or in a; 3 — 3a; — 18 = 0, of which x — 3 is 
a root ; but in general the roots of non-linear equations 
are irrational. Even with integer coefficients a, 6, c the 
quadratic equation ax 2 -\-bx-\-c = 0 cannot always be 
solved unless complex numbers are introduced. For the 
elementary method gives the formal solution 


-b±W(b 2 -4ac) 

— — •- «*>■ 


(3) 


which is only possible in terms of real numbers when 
6 2 — 4ac^0. If 6 2 — 4ac is a perfect square p 2 , >0, there 
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are two solutions x = — (b±p)/2a which are both rational. 
If b 2 — 4ac is positive but not a perfect square there are 
two irrational solutions. If 6 2 — 4ac = 0 there is one 
solution ; it is rational, and the quadratic is said to have 
a repeated root. But if 6 2 — 4ac is negative there is no 
real solution : and at one time in the history of mathematics 
such an equation was rejected as impossible. However, 
the more enterprising of mathematical pioneers — Cardan, 
Napier, Wallis, Leibniz and Gauss — boldly went forward, 
asserting the existence of a new type of number, which 
Napier called the ghost of a real number but which 
nowadays is called the complex number. The formula (3) 
gives two complex values of x whenever b 2 — 4ac<0, that 
is when 4ac— 6 2 is positive. It is usual to reduce x to 
the form 

X = a±ip, . . . (4) 

where a = — 6/2a and /? = \/ (iac- lr) , so that both 
a and are real, but i is not real, although it satisfies 
the quadratic equation with real coefficients, 

i 2 +l = 0. . . . . (5) 

Here we have made two important assumptions : 

(i) that equation (5) has a root i (which can formally 

be written yj — 1), 

(ii) that this root i is a number which combines with 

the real numbers according to the ordinary laws 
of algebra. 

Example. — The equation 2x 2 -fa;+3 = 0 has complex roots 

— --fi ; so also has 2x 2 + 6x-\-5 = 0, the roots of which 

4 4 

are — 

4. Geometrical Treatment of Complex Numbers. 

There are several ways of justifying the above assumptions 
regarding complex numbers, but perhaps the most 
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attractive, and on first consideration the most convincing 
method, is that of Gauss (1797), who represented a complex 
number a±if} by a point (a, /}) referred to rectangular 
Cartesian axes Ox, 0 y in the familiar way. 



To each point P of a plane there correspond a number 
pair (a, /?) and a complex number a There is thus 
a one-to-one correspondence between the finite points of 
the plane and the finite complex numbers. The plane is 
called the complex number plane, the Gauss plane or, some- 
times, the Argand diagram. 

In polar coordinates {r, 9} we have a = r cos 9 = ON, 
= r sin 0 = NP and therefore 

a-\~ij 8 = r( cos sin 9). . . (1) 

We may always avoid any ambiguity by assuming that 

r ^ 0, — - 77 < 9 ^ 77. . . . (2) 

With these provisos, which attach to each complex number 
one value of r and one of 9, we call r the modulus and 9 
the amplitude of the number z = a+i/? represented by the 
point P. The notation |z| is used for the modulus. Thus 

1*1 = |“+^l = r= . . (3) 

Also tan 9 = fi/a or am z = arc tan fi/a — tan” 1 /?/a. . (4) 

Real numbers are represented by points on the axis 
of x or by number pairs of the type (a, 0), where a can be 
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positive, negative or zero. Pure imaginary numbers 
are represented by points on the axis of y, except the 
origin (0, 0), or by number pairs of the type (0, /?), where 
/3>0 or /?<0. For this reason the line y = 0 is sometimes 
called the real axis and the line x = 0 the imaginary axis . 



PKP) 


A 

(!>°) 


The symbol i is to be regarded and defined as an 
operation upon real numbers which changes the real pair 
(a, /3) on which it operates to ( — /?, a), or geometrically, 
the operator i is defined as that which turns the line OP 
in a positive sense to OQ through a right angle. In the 
figure of a circle PQRS, ignore the points ABCD for the 
moment, and suppose the radius r to take any positive value. 

From this definition everything follows. Repeat the 
operation and OP is turned to OR through two right 
angles : three operations Hi turn OP through three, and 
four operations iiii through four right angles. If z — a+ij3 
then the four expressions iz, iiz = i 2 z , iiiz = i 3 z , iiiiz 
= i*z denote the four complex numbers corresponding to 
Q, R, S, P, the vertices of a square centred at 0, for all 
non-zero values of z. We note that i(ii) = (ii)i = in. 

In particular if z == 1 and P is at A, then i M i 2 , i z are 
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written short for i 1 , i 2 1 , i 3 l and correspond to the points 
B, C, D respectively. But C has coordinates (—1, 0) ; 
hence i 2 = — 1. This links the present definition of i 
with the previous assumption, that i denoted the root of 
the equation i 2 + l — - 0. 

Next we can show that ifi, pi mean the same number. 
For both correspond to the point (0, P) — either by taking 
a length OP = P along Ox and turning it through a right 
angle, or by taking a unit length along Ox, turning it 
through a right angle to get i and then measuring off p 
such units along O y. 

Addition is defined by the identity 

a+ifi+y+i8 = (a+y)+i(P+$). . (5) 

If P is (a, /?) and R is (y, 8) and OPQR is a parallelogram, 
then Q denotes the complex number so obtained by 
addition. Geometrically the points (0, 0), (a, /?), (y, 8), 
(a+y, /3+S) form a parallelogram. 




Subtraction follows at once as the inverse of addition. 
If we write, for short, OP+OR = OQ, then OQ— OP = OR. 
Or, again, replace y and 8 by — y and —8 in the above, 
and we have 


a+ij3— y— iS = (a— y)+i{p— 8). 


(6) 
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Multiplication is defined by the identity 

(a+ifi)(y+i8) = ay— /?8+i(a8+0y) . (7) 

which is readily formed by the ordinary rules of algebra, 
and the relation i 2 — —1, but is geometrically equivalent 
to the following rule : 

Let the corresponding polar coordinates be {r, 9} and 
{ 5 , <f>}. Then 

{r,9}{s,<t>} = {rs,9+<f>} > . . . (8) 

that is, multiply the moduli and add the amplitudes to get 
the modulus and the amplitude of the product. If d-\-<f>>Tr 
or <— 7 r, the amplitude is of course the equivalent angle 
between ±tt. 

To prove that this rule is equivalent to the identity (7) 
we take, as is usual, 

a — r cos 6, [} — r sin 9, y — s cos </>, 8 = s sin <f> (9) 

and thus we can write (a, ft) = {r, 9} = r (cos 9+i sin 9) 
to denote the same complex number. Hence 

(a+ifi)(y+iS) 

=r( cos 9+i sin #)s(cos sin <f>) 

= rs ( cos 9 cos <f >— sin 9 sin cf>+i ( cos 9 sin <£-f- sin 9 cos <£)) 

= r$(cos ( #+<£)+* sin (#+</> )), 

or (a, jS)(y, 8) = {rs, #+<£}> which is what we wished to 
prove. The rule given above for forming the product of 
two complex numbers is in fact a geometrical statement 
of Demoivre’s theorem. 

Division is defined as the converse of multiplication, 
namely, 

{r, 0} = {r/«, s^O, . . (10) 

for, by the earlier rule, 

{r/s, 0 — <f>} {«, <f>} = |js, 6—<{>+<f > |= (r, 6}. 

Manifestly (r, 6} { 5 , <f>} = {«, <f>} {r, d}. 
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It is well to notice that this rule of multiplication incor- 
porates the earlier geometrical definition of i. Instances of 
(7) are 

{2, 0} {1, Jtt} = {2, £*}, 

{3, £tt} {2, -in} = {6, 0}, {], \n} {1, in} = {1, 7 r}, 
that is to say, 2 xi — i2, 3 ix(—2i) — 6, ixi — — 1. 

5. Complex Numbers as Hoots of Quadratic 
Equations. The number a+t*/3 is a general complex 
number, separated into real and imaginary parts a, ij 8 
respectively. Sometimes /3 is referred to, rather loosely, 
as the imaginary part of the complex number. We 
note that every such complex number is a root of 
a quadratic all of whose coefficients are real : for if 
x = a+i/J then (x—a) 1 2 — (i/3) 2 = i 2 /3 2 = — /3 2 . Hence 
x 2 — 2cur+a 2 +/3 2 = 0, which is a real quadratic equation. 
Similarly, a -d/3 is a root. We call these two roots con- 
jugate complex roots, and we say that form a pair of 

conjugate complex numbers. 

The sum of two conjugate complex numbers is always 
real, the difference (whenever /3y^0) is a pure imaginary. 

We have introduced complex numbers by going no 
further than quadratic equations whose coefficients are 
integers. The questions naturally arise, what happens if 
we allow the coefficients themselves to be fractional, 
irrational or even complex, and again if we consider 
equations of higher degree ? It is one of the dramatic 
surprises of mathematics to find that no further generaliza- 
tion is necessary, in that any such equation can be 
completely solved in terms of complex or simpler numbers. 
This fact, which is called the fundamental theorem of 
algebra, was first proved by Gauss (c/. p. 56). At present 
we shall assume it, and shall illustrate its scope by two 
examples. 

1, To solve the complex linear equation ax-\- b =0, 

where a and b are complex. Let a — a +i£, <2 = a— ip ; then 
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calculate x — — bajad . For instance if (2-f3t)a;-f-4 — 5i = 0, 
we have 

5i-4 (5i — 4)(2 — 3i) 7 + 22 i 7 ( 22. 

* ~ 2+3 i ~ (2+3i)(2 — 3i) “ 4-9 i 2 _ l3 + l3 1 ' 

The point of the process is to render the denominator real 
and thus to separate the real and imaginary parts of x. We 
realize the denominator of a complex fraction by a method 
analogous to that of rationalizing the denominator of a surd. 

2. Every polynomial whose coefficients are complox can 
bo regarded as a factor of a polynomial whose coefficients are 
real. For example: (2 -\-i)x 2 — (3 -\-2i)x-\-i. Arrange this as 
(2x 2 — 3x) -}-(x 2 — 2 x-{-\)i and multiply it by the conjugate 
polynomial (2x 2 -3x)—(x 2 —2x-\-\ )i, obtained by changing 
i to — i. The result is 

(2x 2 — 3x) 2 -\-(x 2 — 2x-4-l) 2 > 
or 5x* — 16:r 3 + 15a: 2 — 4x + 1, which is real. 

6. Algebraic Numbers and Integers. Within the 
scope of real and complex numbers further demarcations 
can be made. If a 0 =fi 0 and each coefficient of the poly- 
nomial in 3 (1) is an integer, then each root, real or complex, 
of the equation f(x) == 0 is called an algebraic number : 
conversely any such number must be a root of such an 
equation for a finite value of n. In particular if a 0 = 1, 
each root is called an algebraic integer : for instance i is 
an algebraic integer. So also is \/3, for it is a root of 
x 2 —3 — 0 ; and again the complex surd co = (— l-)-i\/3)/2 
is an algebraic integer, for it is a root of the equation 
to 2 -j-co + l = 0. The numbers are sometimes called 
the imaginary units , for they have properties resembling 
those of the real units +1. In fact if x = or +1 
then any integral power of x is equal to one or other of 
these four units. 

Algebraic numbers may be rational or irrational, but 
it does not follow that all irrationals are algebraic : such 
as are not so are called transcendental, well-known examples 
of which are tt = 3*1415926535... and e = 2*7182818284.... 
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Neither of these is the root of an equation of finite degree, 
whose coefficients are integers. The first attempt to prove 
the transcendence of rr and e was made in 1667 by James 
Gregory, but the first strict proof was given by Lindemann 
as recently as 1882. 

Note. — The arithmetical foundations of the theory of 
rational and irrational numbers is due to Dodekind and Cantor. 
For a full treatment of real and complex numbers the reader 
is referred to G. H. Hardy, Pure Mathematics (Cambridge, 
1938), pp. 1-33. 

7. Rational Functions. By an obvious extension of 
ideas we classify functions of a variable x in much the 
same way as numbers, into integral, rational, irrational, 
algebraic and transcendental functions. The result of 
combining x with itself and with constant numbers by 
addition, subtraction and multiplication in any order, 
with or without repetitions, but only with a finite number 
of such elementary operations, is called a rational integral 
function . It is in fact a polynomial , which can always 
be arranged in descending powers of x. If division is 
also admitted a finite number of times the function is 
called a rational, but it may be fractional, function. 
Practice with these processes will have convinced the 
reader that any such function can be reduced to one or 
other of the forms 

f(x)l<f>(x), f(x), c, 

where both f(x) and </>(#) are polynomials in x , and c is a 
constant, which may possibly be zero. 

Examples. 

(i) — 0 (ii) X - p (hi) (x+a)(x-a)-{x-\-b)(x-b). 

\+ 2 - x ~ l 
x 

(i) Here f(x) = x 2 -{-Zx, (f>(x) = ic + 2 ; (ii) f(x) = a? + l ; 

(iii) c = b 2 — a 2 . 
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Examples of polynomials or rational integral functions : 

2x-|-3, x 8 , ax^+bx+c, £x a — 3, (px—q)* -\-rx t 2x n 4-3x ,) — 1, 
where n and p are po utive integers. 

Rational fractional functions : 

1 a-\-bx 2 — 3x4-4x 2 1— x n 1 2 

~ — ! — , ' , , , , x- 8 — 2x~ 4 4-x. 

x c4-dx x a 1— x 2n x — 1 x — 5 

Irrational functions : y/x , ^(x 2 — 2), (1 -f ^x)/(l — x). 

Transcendental functions : e®, sin x, log x, sin _1 x. 

The above examples are typical but not exhaustive. 
The functions also maybe either algebraic or transcendental, 
so that we include under algebraic both rational and 
irrational cases. The feature of an algebraic function 
y = is that it is always possible to express the relation 
between x and y rationally as a polynomial involving 
both variables, say f(x, y) = 0. For example, 

if y — (1+ %x)l(l— x), then (y(l— *)— l) 3 — x — 0. 

This is called a sextic equation in x and y , since the term 
of the highest degree is x 3 y 3 . It is impossible to express 
a transcendental function by such an equation of finite 
degree. 

Further examples. Discuss the nature of the following 
functions: \/(^ a 4-2x4-l), VO^ + l)* (1— sin a x/(l — cos 2 x), 
(x 2 4 -2x)/x. [Rational, irrational, polynomial, polynomial.] 

8. Tabular and Graphical Representation of a 
Polynomial. If f(x) is a given real polynomial of degree 
n, and if y = f(x), then it is always possible to represent 
the function f(x) both graphically and by means of a 
table. We take particular values x 1 , x 2 , x 8 , ... of x and 
calculate the corresponding values y l9 y 2 , y 3 , ... of y . 
Since f(x) is rational, integral and real, we obtain one real 
value of y for each x. Had f(x) been a rational fraction 
this would not be the case, for there would be no value 
of y corresponding to such exceptional values of x as cause 
a denominator to vanish. The results ca: 
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tabulated either by rows or by columns in the familiar 
way, and when this is done the table gives a representation 
of the function. While it is a suggestive and useful 
representation, yet it is only approximate, for the values 
of y are usually decimalized and accurate only to a limited 
number of decimal places, and in any case a complete 
set of values in any interval whatever is out of the question. 
It is useful to arrange the values of x from left to right, 
or else columnwise, in ascending order. 


Example. y = f(x) = x 2 —x 2 — 4.c-f 4. 


X 

-10 


-3 

-2 

-1 

0 

1 

2 

3 

4 

5 

6 


10 

y 

-1056 


-20 

' 

0 

6 

4 

0 

0 

i 

10 

1 

36 

84 

160 

i 

864 


A B CDEFGH 


However full the table is, it is always possible to 
increase it, either by extension or by interpolation. By 
the latter is meant inserting new values of x between 
adjacent values of x already found, yielding the correspond- 
ing y. What, in this example, are the values of y within 
the interval \<x<2 ? For each such value of x it will 
be found that y is negative. 

Tables such as these were familiar to the mathematicians 
of the sixteenth and early seventeenth centuries, who 
developed great skill in devising methods for elaborating 
them : their aim was to render tables of trigonometrical 
and logarithmic functions as complete as possible. But 
a great advance in insight was gained in 1638 when 
Descartes published his method of coordinates, whereby 
any such table could be thrown into geometrical form by 
plotting points A, B, C, ... whose coordinates were x 
and y } one such point for each pair of entries. These 
points are distinct and separated : they form a dis- 
continuous graphical representation of the function. 

'ot lie at random, but suggest a curve which 
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runs smoothly through them in the order indicated from 
left to right. 

The greater the number of entries in the table within 
any given interval AH the more clearly will the correspond- 
ing points suggest a curve : and whatever smooth curve 
is drawn through them it will be an approximate 
representation of one precise curve which par excellence 
represents the function. Suppose that this precise curve is 



whatever pair of corresponding values x, y are taken, the 
point P representing them will lie on the curve. We shall 
call this the graph of the polynomial /(x), or the curve 
whose equation is y = /(x), or briefly the curve y = /(x). 

Every polynomial with real coefficients has such a 
graph. The curve extends indefinitely to the left and 
to the right. It must cut any straight line drawn parallel 
to the axis of y in one point and one point only, since y 
takes one definite value for each value of x. The curve 
is continuous, as will shortly be proved ; it has no gaps. 
Its curvature is continuous ; it has no sudden corners, 
but bends smoothly. It crosses a straight fine drawn 
parallel to the axis of x not more than n times. If n = 1 
it is a straight line. If n — 2 it is a parabola, upwards 
or downwards, w ^ with its axis parallel to 0 y. Such 
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a curve, we say, has one bend. If n — 3 it has two bends. 

If n = 4 it has one or three such bends. If n = 5 it has 
two or four. In general if n = 2r it has an odd number 
of bends less than n, and if n = 2r+l it has an even 
number of bends less than n. For the purpose of drawing 
polynomial graphs, or of visualizing them, these are useful 
facts to bear in mind. The reasons underlying them 
depend on the properties of the derived polynomials 
f'(x), /"(#), and of the fundamental theorem that every 
equation has a root. 

By the first, second, ... derived polynomials we mean 

f’(x) = f"(x) — and so on. Thus 

y = f(x) = a 0 x n +a 1 x n ~ 1 J r a 2 x n ~ 2 +a„^a: +a n , 
% = f'W = na o x ”~ 1 + ~ l)Oi* n_!! + . . . +2a n _2a: +a B _i, 

= f"(x)=n(n-l)a 0 x n -*+(n-l)[n-2)a 1 x n -*+...+2a n _ 2 , 

and so on. Each such derivative is a polynomial of degree 
less by unity than its predecessor. Accordingly the 
(n— I)** derivative is a linear polynomial, while the n th 
derivative is the constant d n y\dx n = n\ a 0 , which is non- 
zero. All higher derivatives must vanish. 

Example. 

y = j(x) — x z — x 2 — 4x 4 - 4, y' — J\x) — 3x 2 — 2x— 4, 
y ” = /"(*) = 6x-2, y'" = /'"(*) = 6, /^(x) = 0. 
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CONTINUITY AND EVALUATION OF 
POLYNOMIALS 

9. Continuity of a Polynomial Function. Let (x, y) 

be the coordinates of a point P, regarded as fixed, which 
lies on the polynomial graph y ==f(x). Let Q be another 
point (x+h, y+k) also on the graph, so that y+k — f(x+h). 
We shall say that the function f(x) is continuous at a; if we 
can make k as small as we please by taking h small enough. 
More precisely, if € is a given positive number however 
small, then k must lie between — e and c (that is —e<k<e) 
whenever h lies between —rj 
and rj, where rj is a positive 
number (usually small) which 
can be ascertained when e is 
given. This statement, a little 
difficult at first sight, turns out 
on reflection to agree with our 
idea of continuity for a curve ; _ 

but it has the merit of giving O 
in precise quantitative form a 
test for what seems to be a qualitative property. Since a 
value of y exists for each value of x there can be no gap 
in the curve as traversed from left to right ; but there 
might conceivably be one or more gaps vertically. 
Suppose, in su,ch a case, that P is the last point on the 
curve before such a gap, and Q the first point after the 
gap ; then we have merely to choose e less than the 
distance PQ in order to show that such a curve does 
not fulfil the quantitative test just given. 

When the test in question is satisfied at each value of 
17 B 
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x ( — oo<x<oo) we say that the function is continuous 
throughout, and so too is its graph. It will now be proved 
that this is always true of a real polynomial f(x). 

Consider first the monomial function y — x n . Then 
y+k = (x-\-h) n t so that 

k = (x-\-h) n — x n = h(nx n ~ 1 -\-n {2) hx n ~ 2 +^ n_1 ) = hA f 
where n {2) = n(n — 1)/2!, w (f) = n(?i— r-\-l)/r !, a 
notation for the binomial coefficient. Each number x, h t 
n , n (r) is finite : the series has n terms and each term has 
a finite number of factors. Consequently A is finite. 
Hence k can be made as small as we please by taking h 
small enough. If k is to be made numerically less than € 
we simply choose h numerically less than e/B, where B is any 
non -zero constant which is numerically greater than the 
finite but variable A. Thus the function x n is continuous 
throughout. Exactly similar remarks apply to a 0 x n , to a 
sum of a finite number of such terms, and so to a poly- 
nomial in general. 

10. The Zeros of f(x), f'(x) and f"(x). The values 
of x for which f(x) vanishes are called the zeros of f(x) ; 
they are the roots of the equation f(x) = 0. At each 
zero of f(x) the value of y is zero ; hence if x is a real root 
the corresponding point (x, y) of the curve lies on the axis 
of x. For example the three points B, E, F of the figure 
in 8 correspond to three real roots of the cubic equation 
there represented, for which x — —2, 1 or 2. 

From the differential calculus we know that /'(x) gives 
the gradient of the tangent at (x, y) to the curve y = /(x), 
and that /'(x) vanishes at each point where the tangent is 
parallel to the axis of x. Such points are called turning 
points. At other points on the curve f'(x) is either positive 
or negative. If positive, /'(x) >0, then at such points the 
curve is ascending, as x increases in value from left to right : 
if negative, /'(x)< 0, the curve is descending as x increases. 

Again, f(x) is positive whenever f(x) increases, that 
is, the gradient steepens, which always implies a bend of 
the curve with its concavity upwards ; and /*(x) is negative 
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whenever f'(x) decreases, which happens when the bend 
of the curve has its concavity downwards. Each real 
zero of f"(x) corresponds to a point of inflexion on the 
curve, whenever f"'(x)^ 0. Such points separate the 
upward from the downward bends of the curve ; these 
bends naturally must occur alternately. Usually the 
tangent at an inflexion is not parallel to the axis of x y but 
when it is, the point is both inflexion and turning point. 
The origin in the curve y — x 3 is a good example of such a 
point ; and the reader should plot and draw the curve for 
inspection. Other turning points, for which f'(x) = 0, 
0 are classified as maxima or minima points on the 
curve : the maxima are at the crests of the waves and the 
minima at the troughs. 

Always reading from left to right (that is, with increasing 
x) we may sum up as follows : 


The curve y — f(x) rises 
falls 

has a turning point 
a maximum 
a minimum 
is concave downwards 
is concave upwards 
has a point of inflexion 


when f'(x)> 0, 

.. /'(*)< 0 , 

.» /'(*) = 0 , 

„ /'(*) = 0 ,/'(*)< 0 , 

„ /'(*) = 0,f(x)>0, 

/"(*)< 0 , 

>» /"(*)> o, 

/"(*)= 0 , /"'(*)*<>. 


1. Prove that the curve y — x 3 — x 2 — 4x + 4 of 8 has a 
maximum between C and D, an inflexion between D and 
E, and a minimum between E and F. (Maximum at 
x = —0-87, inflexion at x — -f i, and a minimum at 
x = 1*54.) 

2. The curve for which f"(x) — 6£ 2 -f3:r + l has no 
inflexions. 

3. The curve y = x*+x* +x 2 +ax-[-b has no inflexions: 
while y = x A -\-x 3 — §x 2 -\-ax-\-b has two inflexions. 

4. Examine the case when both f'(x) = 0, /"(a?) = 0. 


11. Behaviour of a Polynomial at Infinity. It is 
useful to consider the behaviour of the polynomial f(x) 
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for large values of x , positive or negative. For this purpose 
let the function be provisionally written as 

f(x) = a 0 x n ±a 1 x n -'±a 2 x n -*±...±a n , . (1) 

where the sign of each term is fixed, either + or — , and 
each of a v a 2) ..., a n is either positive or zero. If x>0 
each power of x is positive, and the value of f(x) obviously 
is not greater than 

a 0 x n +a 1 x n - 1 +a 2 x n - 2 +...+a n . . (2) 

nor less than 

a Q x n -a x x n ~ x — a 2 x n - 2 -...-a n . . . (3) 

Also if z>l, then x n >x n ~ 1 >x n ~ 2 >... >x>l. Hence 
a f a; n “ r is less than whenever r = 2, 3, ..., n. Conse- 

quently, if x>l, 

a<F n + (Oi +a 2 + .. . . +a n )x n ~ 1 

> f{x) > a 0 a: n -(a 1 +a 2 + ...a n )^-i. 

Hence, if A = a 1 +a 2 +...-\-a n , we have 

(a Q +Ajx)x n > f(x) > (a 0 —A/x)x n . . . (4) 

Now choose a positive constant e, however small. Since 
A is finite and positive, we can ensure that A/x<e by 
taking x>A/€ ; that is, by taking x large enough we 
ensure that f(x) lies between the values y x = (u 0 +e)x n 
and y 2 = (a 0 — e)x n . 

Geometrically, the graph of y must lie between those 
of y x and y 2 for large enough values of x. Let the graph 
of y 3 be drawn, where y 3 = a 0 x n , this being the limit of 
both y x and y 2 when €->0. The graphs y v y 2 , y 3 then form 
two very acute curvilinear angles running from the origin 
and bending upwards into the first quadrant if a 0 > 0, 
and downwards into the fourth quadrant if a 0 <0. Also 
y x is uppermost, y 2 lowest and y 3 between. For all large 
enough values of x the curve y is between the uppermost 
and the lowest of these boundaries — the shaded area of 
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the figure. By changing e to or any such proper fraction 
of itself we decrease y v increase y 2 and leave y s unchanged. 
As €—>*0 the graph y approximates more and more closely 
to that of y 3 , which acts as a curvilinear asymptote to y. 

Analytically we say that for large values of x the 
function f(x) behaves like a 0 x n > its leading term. 



Similarly, if x is negative and large, f(x) behaves like 
a 0 x n . If n is odd, x negative and a 0 positive, then a 0 x n 
is negative, so that the curve is situated in the third 
quadrant. If n is even the curve is situated, for large 
negative values of x , in the second quadrant. 

Example. f(x) = x*— x % — 4x + 4. 

Here the sum of the absolute values of all coefficients 
except the first gives A = l-f4 + 4 = 9, so that f(x) lies 
between (1 — e)x* and (1 -f e)x z whenever x>9/e or < —9/e. 

12. The Taylor Expansion of a Polynomial. A 

particular case of an important theorem based on repeated 
differentiation, first employed by James Gregory in 1670 
but first published by Brook Taylor in 1715, is the following 
identity for a polynomial /(a?) of degree n : 

f(x+h) = f(x) +hf'(x) +h 2 /"(z)/2!-f-.- . +h n J M (x)jn\, (1) 
where f lT) (x) denotes the r th derivative of f(x). 
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Proof. The polynomial f(x) is the sum of terms of 
type a r x r . Putting f(x) = ajc r in the right-hand side 
of (1) and making use of 

d k 

— x r = r(r— l)...(r — k-\-\)x r ~ k 9 
ax k 

we obtain 

a r {x r +hrx r ~ 1 +h 2 r i 2) x r ~ 2 + . . .h n r in) x r - n }, 

where r ik) denotes, as in 9, p. 18, the binomial coefficient 
r(r— l)...(r— &+l)/fcl. But this is the binomial expansion 
of a r (x-\-h) r . Hence the theorem in question is true for 
each term of the polynomial f(x) ; and so, by addition of 
terms, for the polynomial f(x) itself. 

Examples. 1. If f(x) — ax 3 +3bx 2 -}-3cx + devaluate/ (x ±h) 
and f(x—b/a). 

2. If f(x) = x 3 — 2x 2 + 3x — 4 evaluate f(x -f 2) and f(x — 1). 

13. Identities and Equations. We must distinguish 
carefully between an identity and an equation. When 
a polynomial or other function f(x) is thrown into another 
form <f>{x) the relation f(x) = (f)(x ) is called an identity, as 
for example 

2x 2 -3x+l = (2x-l){x~l). 

An identity is characterized by the obvious property 
that when it is simplified to its basic terms they all vanish. 
Identities range from arithmetical cases, when no variable 
x appears, to highly elaborate cases involving many 
variables. It is also obvious that an identity is a relation 
which is true for every finite value of such variables. In 
the above instance, whatever finite value x has, the relation 
is true. But a much more important and converse fact 
holds, namely, that if 

f(x) = a 0 x n +a l a; n ~ 1 + . . . -fa n = 0 
is true for every finite value of x then f(x) = 0 is an identity. 
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First Proof. This proceeds by the use of determinants. 
Choose 7i + l unequal numbers a, ft, k and form the 
alternant 

A = |a n /3 n-1 ...K°| 

in a notation where 

|a 2 £ 2 y 2 

, |a 2 ^V°l = i a P y 

II 1 II 



and so on. Then /(a) = a 0 a n +...+a n = 0, and we have 
7& + 1 homogeneous linear equations /(a) = 0, f(ft) = 0, 

/(k) = 0 for the a 0 , a v ..., a n in terms of a, ft, k. By 
the theory of linear equations (Aitken, Determinants and 
Matrices , p. 64) either A = 0 or else all the a { vanish. 
But A is equal ( Ibid ., p. 41) to the continued product of 
the \n{n-\-\) differences of a, ft, ..., k taken in pairs, and 
so cannot vanish, since all of a, ft, ..., k differ. Hence 
all the a { vanish, so that/(x) = 0 is an identity. 

A corollary follows at once, that if f(x) is a polynomial 
of degree n which vanishes for 7i + l distinct values of x it 
vanishes identically for every value of x. 

The argument given above is so important that it will 
be worth while to illustrate it for the case of the cubic 
aQX z -\-a x x*-\-a 2 x-\-a z = 6. 

If a, ft, y, 8 were four distinct roots then 

a 0 a 3 + a x a 2 + a 2 a + a 3 = 0 
a 0 ft 3 + a x ft 2 + a 2 ft + a 3 = 0 
a 0 y 3 + a iY 2 + « 2 y + » 3 — 0 
a 0 8 3 + cq8 2 + a 2 8 + a 3 — 0. 


On eliminating a 0 , a 1 , a 2 , a 3 from these equations we have 


1 


P 3 P P 1 I _ o 

a a i l — 


A(aftyh) — | o 2 i i 

I y 3 y‘ y i I 
| 8 3 S 2 8 II 


But A(afty8) = (a — j8)(a — y)(a — 8)(ft — y)(ft — S)(y — 5), and so, 
since ^1=0, two roots at least must be equal, contrary to 
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hypothesis. It follows that the assumption is wrong, and 
that the cubic equation cannot have more than three distinct 
roots. In the same way an n-ic polynomial equation cannot 
have more than n distinct roots. 

Second Proof . If f(z) = 0 for every value of x, f(x) 
neither increases nor decreases in any range whatever, 
and so f'(x) — 0 for every value of x. But then, by the 
same reasoning, f"(x) = 0 for every value of x , and hence 
again f'"(x) = 0, and likewise all derivatives vanish. But 
since the Taylor series of f(x) with respect to x = 0 is 

fi x ) =/(0)+z/'(0)+z 2 /"(0)/2!+...+a; n / < " ) (0)/tt!, 
we see that 

«» =/(0), a„_ i=/'(0), a n _ 2 — /"(0)/2l, ..., a 0 = /<»>(<>)/»!. 

Hence all the must be zero. 

It follows from this theorem that if f(x) = <f>(x) is an 
identity, true for every value of x> we may equate the 
coefficient of each power x r on the left to that of x r on the 
right. In fact we merely write f(x)—cf)(x) = 0 and apply 
the theorem. From an algebraic identity involving 
polynomials of the n th degree we thus obtain n+1 arith- 
metical identities. The identities so obtained may range 
from the most obvious tautologies to highly elaborate 
arithmetical propositions. 

Unless all the a { vanish f(x) = 0 is an equation of degree 
given by the index of the highest power of x whose co- 
efficient does not vanish. Hence, by the above theorem, 
an equation of the n th degree cannot have more than 
n roots. 

A polynomial identity can be described as having degree n 
when terms up to the power x n occur in it. If so the identity 
must be taken at its face value— the description is given 
before simplification is undertaken. Thus 

(a;+l)(a;— 1)— a? a 4-l = 0 
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is a quadratic identity. On the other hand, 

(x + l){x-\)--x 1 +2x — 0 
is not a quadratic but a linear equation. 

14. The Practical Evaluation of a Polynomial. 

It is manifestly of importance to acquire facility in 
calculating the value of f(x) for special values of x. 
How, for example, can we best evaluate /( 4) when 
f(x) = 2a; 3 — 3a; 2 +4#— 5 ? Instead of direct substitution, 
which is often troublesome and liable to errors, the 
following process, introduced in 1819 by Horner, is to be 
recommended, particularly as it furnishes the technique 
required also for the actual numerical solution of the 
equation f(x) = 0. The work is arranged as follows : 

f(x) — 2a; 3 — 3a; 2 +4a;— 5. 

2 —3 +4 -5 (4 

8 20 96 /(4) = 91. 

5 24 91 

The coefficients 2, —3, 4, —5 with their proper signs 
are placed in a row according to descending powers of x. 
Missing powers must bo indicated by zero coefficients. 
Thus we have four columns {n+\ in general). The 
argument 4 of the desired /( 4) is then entered, as if it were 
the quotient of a long division sum, beyond the final 
column. Beginning at the left the leading entry 2 is now 
multiplied by 4, the product, 8, being written below the 
next entry on the right and added. The result 5 is again 
multiplied by 4 and the product, 20, is added to the 
coefficient in the third column, giving 24. This again is 
multiplied by 4 and the product, 96, is added to the 
coefficient in the fourth, the last column. The process is 
then complete and the result is/(4) = 91. 

The explanation is simple : and it is only necessary 
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to carry out the same procedure with letters to convince 
ourselves of its correctness : — 

a b c d (x 

ax ax 2 -\-bx ax*-\-bx 2 +cx 
ax+b ax 2 +bx-{-c ax 3 -\-bx 2 -\-cx-\-d = /(#). 

According to the rule the last entry in the final column, 
ax 2 -\-bx 2 ~\-cx-\-d } should be the value of the polynomial 
whose coefficients in order are o, b, c, d and whose argument 
is x , and this is indeed the case. The method depends on 
the simple identity 

f(x) = ({ax-\-b)x-\-c)x-\-d y 

where brackets are introduced so as to resolve all powers 
of x into single factors. This can obviously be done for 
the general polynomial of degree n. 

The reader will find it instructive to divide 
2x 2 — 3x 2 -f4x— 5 by x— 4, using long division. The 
quotient is 2x 2 +^a:+24 and the remainder 91. Horner’s 
method (sometimes called the method of synthetic division) 
provides all the materials for the quotient and remainder 
more compactly than the ordinary method : and it has 
the psychological advantage of employing addition rather 
than subtraction as the staple operation. 

Horner’s method yields the quotient and remainder 
at once whenever a polynomial f(x) is divided by a linear 
divisor x— a. For the more general divisor ax-\-j3 , divide 
f(x) first by a and then by x + /8/a. This yields the correct 
quotient, but the remainder R is relative to #+j8/a. It 
must be remultiplied by a to give the correct remainder. 

The method does not apply to quadratic and higher 
divisors. 

Examples. If f(x) = x 6 — 2z 4 + 3, find /( — 3). 

1-2 0 0 0 3 (-3 

-3 15 -45 135 -405 

-5 15 -45 135 -402 =/(- 3). 

Find also/(4),/(0*2),/(3), and try other values of x. 
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15. The Graphical Method of Lill. An interest- 
ing graphical method of evaluating /(a), devised by Lill, 
has an advantage when the coefficients of the polynomial 
are awkward decimals, but an approximate result alone 
is required. Squared paper is useful. 

We lay off straight lines AB, BC, CD, DE, ... of lengths 
equal to the respective coefficients a 0 , a v a 2 , a 3 , .... At 
each point B, C, D, ... there is a right angle corner between 
consecutive lines, to the right if the next coefficient has 
the same sign and to the left if the sign changes. The 
whole track ABODE... may be regarded as a plan of a 
route through a rectangular system of streets. We shall 
suppose that none of the coefficients vanish, so that there 
are n + 1 segments and n corners to the route. Such a 
graph is clearly a representation of the function, a new 
type but none the less a representation. 

Now take an acute angle 0, such that tan 0 = x : 
then one such angle corresponds to a given real x , positive 
or negative, and the sign of 9 is the same as that of x. 
Draw a line AP meeting BC (produced if necessary) at 

P, and such that the angle BAP is 9. If AB is set 
off horizontally from left to right, then P falls below 
AB when x is negative and above when x is positive. 
Draw a broken line APQR... such that the angles P, 

Q, ... are right angles situated on the lines BC, CD, DE, ... 
respectively. Then the value of f(x) is given by the 
distance RE, from the last such point R to E, the end of 
the final segment a n . This distance is measured according 
to the sense of the distance DE : thus in the first illustration 
DE is +4, the direction of measurement is from D to E, 
hence RE is positive. In the second illustration DE is 
—3, therefore RE, having the same sign as DE, is negative. 

To prove the result we work out the segments step by 
step. Thus in the first figure AB = a 0 , BP = a 0 tan 0, 
PC = a x -\-aQ tan 9 — a Q x-]-a v But QC = PC tan 0, hence 


QD = CD— CQ = a a +(a 1 +a 0 a:)a;. 







GRAPHICAL METHOD 


29 


Finally, RE = DE— DR = DE— DQ tan 0 
= a 3 +QD x 
= a 0 x z +a 1 x 2 +a^-\-a s . 


The process can be applied 
to all such cases and for all values 
of n. 

Evidently the equation is 
solved whenever the point R 
coincides with E, for in this case 
RE vanishes* It requires but 
little practice to see that this 
would happen, in our first illus- 
trative example, when 6 is rather 
more than 45°. This furnishes 
an interesting and ingenious 
method for locating the roots of 
an equation. 



Examples. 1. Adapt the rule I 
to the case where one or more of { 
the intermediate coefficients are E 
lacking. x* + 2x—5. 

Proceed as before but make the 
necessary points coincide. For x* + 2x — 5 we have AB = 1, 
CD = 2, DE = 6. Whether BC is regardod as +0 or -0 



A 


x 3 -j- 2x ,s — 5. 
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the rule requires CD to be in BA produced, a right-about or 
a left-about turn. 

2. Justify the preceding figures for x z + 2x — 5 and for 
x*-j-2x 2 — 5, where f(x) is represented by RE with x — 1 . 

3. Show that x z -\-2x — 5 = 0 has a root which is slightly 
greater than unity. (Move P slightly upwards to cause R to 
coincide with E.) 

4. Draw the zigzag graphs ABC... for a; 2 — 4, x 8 -f-8, 
x* -f x 9 -f 2x 2 + 2x + 1 . 

5. Show from the zigzag that 2a: 4 — 2a; 3 -J- 4a; 2 — 4a; -J- 1 = 0 
has a root between 0 and 1. 

16. Horner's Method of Reducing* an Algebraic 
Equation. For the purpose of solving a numerical 
equation by successive approximations — a method which 
is exemplified in ordinary arithmetic when we calculate 
y/2 as 1*4142..., digit by digit — we need a practical way 
of expressing f(x) as a polynomial g(y) where y = x+a. 

Thus f(x) = a 0 x n ^a l x n ~ x -\-...-\-a n _ 1 x-\-a n . . (1) 

and g(y) = b 0 y n +b 1 y n ~ 1 + ...+b n _ 1 y+b„ . . (2) 

are to be identically equal when y — x—a. The co- 
efficients a { are supposed known ; so too is a. We require 
to find the coefficients b { . 

This can of course be done by Taylor's theorem ; in 
fact 

9(y) = /(*) =f(y+a) =f(a)+yf'(a)+...+y n f M (a)ln\, (3) 

which exhibits /(x) as a polynomial in y of degree n. Hence 
b n — /(a), 6 n _ x = /'(a) and so on. Incidentally this shows 
that the degree n is the same both for g(y) in y and f(x) 
in x. Theoretically, therefore, the question is solved, but 
we still need a convenient practical way of calculating the 
coefficients b { . This is done by Horner’s method. 

Suppose, for example, that we wish to express 

f(x) = 2X 4 — 3x 3 +4x 2 — 5x+6 

in the form 

6 0 (x — 2) 4 +6j(x— 2) 3 +6 2 (x— 2) * +b z {x— 2) +& 4 . 
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We apply Horner’s method, dividing f(x) by a;— 2. From 
the Horner scheme 

2 -3 4 -5 6 (2 

4 2 12 14 

f 6 T 20 

we infer that the quotient is 2a; 3 +a; 2 -f 6a;+7 and that the 
remainder is 20. In fact 

2a; 4 — 3x 3 +4a; 2 — 5a;+6 = (2x 3 -\-x 2 -{-Qx-{-l)(x— 2)+20. 

The reader will find it instructive to reverse the process 
and start by multiplying out this identity. The above 
scheme will then be seen from another point of view. 

Now let 2x 3 -\-x 2 -\-6x ~\ -7 be treated similarly and divided 
by x— 2. From the scheme 

2 1 6 7 (2 

4 10 32 

5 16 39 

we infer that 2a; 3 -fa; 2 + 6a; +7 = (2a; 2 -f 5a; +16) (a;— 2) f 39. 
Again, from 

2 5 16 (2 

4 18 

"9 34 

we infer that 2a; 2 +5a;+16 — (2x-\-9)(x— 2) +34, 
and finally from 

2 9(2 

ji 

13 

that 2a; +9 = 2(x-2)+13. 

We can telescope these results and write 
2a: 4 —3a; 3 +4a; 2 —5a; +6 = (((2y + 13)y+34)y+39)y+20 
= 2y 4 +13y 3 +34y 2 +39t/+20, 
where y = a;— 2, and this is the required form. In fact 
b 0 = 2, b x = 13, b 2 = 34, b z = 39, b A = 20. 
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We notice that the successive remainders have furnished 
the coefficients b from right to left, except for 6 0 , which is 
the same as a 0 . Clearly too the process is true in general : 
the b { are the remainders on dividing by y (= z—a) first 
f(x) and then its successive quotients. 

Furthermore — and this is the chief advantage of the 
method — we can tabulate the whole process in one 
scheme : 


—3 

4 

1 

jt 

4 

9 

4 


4 

2 

6 

10 

16 

18 


-5 

12 

7 

32 


6 (2 

i4_ 

|20 


34 


[39 

f(x) = 2x 4 ~-3x 3 +4x 2 ~5x-\-6 
= 2y 4 + 13y 3 +34y 2 +39y +20, 
where y = x—2. 


13 


It will be seen that this scheme incorporates all that has 
been said, but without needless repetition. The method 
is easy to memorize directly from such a scheme. Build 
the scheme from left to right by rows. At each step 
multiply the balance in a column by a(a = 2 above) and 
add it to the balance in the next column to the right. 
Perform the step once in the last column, twice in the last 
but one, and so on. Rule off with a vertical line at the 
completion of each stage. 


Examples. 1. 


14 0 - 

-70 (3 

3 21 

63 

7 21 

- 7 

3 30 

f(x) = x*-\-4x t —70 

10 [61 

= 2/ 3 + 1 3t/ a + bly 

J 1 

where y = a — 3. 

13 
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2. Prove by Homer’s method that 

x*-l = (x + l) 4 _4(a; + l) 3 -f6(a;-h 1 ) 2 -^-H). 

3. Express f(x) as a polynomial in y when 

f(x) — x 3 — 4a; 2 -f 70 and y — x — 3 ; also when y — a? + 3. 

4. From Example 1 obtain (51, 20, 6). 

5. Evaluate (2a: 5 — 3a; 4 + 5a: 2 — 6a; + 7) -r (x — 2), -r (a:-j-2), 
and again (a;-f-3). 

6. Obtain a, 6, c, d, e, / from the identity 

a;®-Hz 3 + l = (x-2)*+a(x-2)*+b(x-2)* 

-\-c(x — 2) 3 -\-d(x — 2) 2 ~\~e(x — 2) -f/. 

7. If a; 8 - 4a: 2 -f- 8a:- 1 = 

(x — l)(x — 2)(x — 3)+a(x — l)(x~2)+b(x — l)+c, 
find a, b , c. 

1 — 4 8 -1(1, 2, 3 

1 -3 5 

^3 5 4 = o 8 - 

2 IZ? 

3 = b 

3 

2 = a 


Three changes of sign : all 
corners to the left. 

( 6 = 45°, a; = 1). 

Draw the track when x = ij. 

9. Reduce £4, 2s. 8d. to pence by Horner’s Method. 

4 2 8 

80 984 
82 992 

Thus Homer’s Method is simply the systematic use of a very 
familiar process in elementary arithmetic. 

C 


/ 
' e 


A B 


x 3 — 2x 2 + 3a;— 4. 
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THE THEORY OF RATIONAL FUNCTIONS 

17. The Division of Polynomials. The arithmetical 
process of division plays two distinct roles : it is either 
a sharing , or else a measuring , process. To divide 28 inches 
into 7 equal parts is a sharing process, to find out how 
many times a length 7 inches is contained in 28 inches 
is a measuring process, but the same arithmetical symbol 
28-1-7 is used for both. When the exact multiple 28 of 7 
is replaced by a number such as 23, which is not a multiple 
of 7, the distinction is more evident : sharing leads to a 
proper fraction 2/7 as remainder, while measuring leads to 
a whole number remainder 2. The quotient 3 is the 
same in either case. 

23 +7 ; 23/7 = 3 +2/7, 23 = 3 X 7 +2. 

Measuring division may be regarded as repeated subtrac- 
tion, proceeding just so far as to produce a zero, or else 
a final remainder which is less than the divisor. 
Subtracting 7 step by step, we have 23, 16, 9, 2. The 
number of terms which are not less than the divisor gives 
the quotient, while the term, if any, which is less than 
the divisor is the remainder. 

If D = dividend, d = divisor, q = quotient, 
r = remainder, then we can state these facts in the form 

D-^rd ; D/d = q+rjd, D = qd-\-r. 

When D and d are positive integers, so also are q and r, 
with the important condition that 0<r<d. 

In algebra these statements are closely paralleled by 
the properties of polynomial division. A whole number 

34 
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such as 384 is really a quadratic polynomial in respect 
to ten : 384 = 3a: 2 + 8a: +4, when x = 10. A decimal 

terminating fraction such as 384-25 is really a rational 
function of ten, expressed as partial fractions, 

2 5 

384-25 = 3 a: 2 +8z+4H ( * when x — 10. 

x x 2 

A non- terminating decimal is an infinite series in descend- 
ing powers of ten. Thus the polynomial f(x) in algebra 
plays the part of the whole number in arithmetic, while 
the rational function 

R[x) = f(x)/<f>(x) 

composed of two polynomials, whose degrees are m and n 
respectively, plays the part of the arithmetical fraction. 

Let f(x) = a 0 x m +a 1 £ m ~ 1 + . . . +a m _ 1 z +<i m , a 0 ^0, 

<t>{x) = c 0 x n +c l x n ~ 1 +...+c n _ 1 x +c„, c 0 ^0, 

where all the coefficients a { and are constants. If 
ra^n the function R(x) is called an improper, and if 
m<n a proper , fraction with regard to x. If m^n we 
may divide f(x) by <f>(x) and obtain 

f(x)l<f>{x) = q(x)+r(x)l<f>(x), f(x) = q(x)<f>(x)+r(x), 

where q(x) is a polynomial of degree m—n, and r(x) is one 
of degree less than n. For example 

x 4 f 2x 3 — x+l __ /nA t ON f 2x 2 — 5 

x *-2x+3 ““ + x z-2x+# 

x 4 -f-2x 3 — x+l = (x+2)(x 3 — 2x+3)+(2x 2 — 5). 

Let us note the following properties : (i) R(x) reduces 
to a polynomial when n is zero, or else when <f)(x) is a factor 
of/(x), in which case r(x) vanishes identically. 

(ii) The highest term in q(x) is a 0 c 0 ~ 1 x m '- n , as is seen 
by actual division, so that the degree of q(x) is m—n . 

(iii) The relation f(x) — q(x)cf>(x) -\-r(x) is identically 
true for every value of x. 
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This division process is unique ; for if not, let q x 
and r x be another quotient and remainder of so that 

f(x) = q<f>+r = q 1 (f>+r 1 , 

where, for brevity, q is written instead of q(x), and so on. 
Then (q—qi)</> = r—r 1 identically. If q^q x the left-hand 
expression is a polynomial expression of degree at least 
that of <f>, whereas the degree of both r and r x and therefore 
of their difference is less than that of <£. This (c/. p. 24) 
is impossible : hence q — q v so that r = r v and the division 
is unique. 

By Euclid’s method of repeated measuring, commonly 
called the G.C.M. or the H.C.F. process, we can discover 
whether f(x) and cf)(x) possess a polynomial factor in 
common. Again, for brevity, let single letters denote 
polynomials. Divide / by <£ giving remainder r : divide 
<f> by r giving a remainder r x : divide r by r x giving a 
remainder r 2 , and so on. Thus 

f=q<f>+r, 

<t> = <h r + r v 

r = q tfi+r* 


fp-2 2p r »-l+ r »- 

Each of/, (f> t r, r v ... is a polynomial, where the degree 
of r is less than that of (f> , that of r ± is less than that of r, 
and so on. Consequently the process must terminate since 
the degree of <j> is finite. Either a remainder r v vanishes 
identically, or the degree of the last r v is zero, that is, 
r v is a constant c^O. 

Case 1. If r v = 0 and all its predecessors are non -zero, 
then r v -. x is a factor of r„_ 2 . But r p _ 3 = g p _ 1 r p . 2 +r p . 1 , 
so that is also a factor of r p _ 8 , and so on throughout 
all the equations. Thus r v - x is a factor common to /, <f> 
and to each r. 

Conversely, any factor 8 common to / and <j> is a factor 
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of f—q<f> and therefore of r, similarly of <f>— q^r, that is 
of r v and so on. Hence r v ^ includes every common factor 
of / and <f > : it must therefore be the highest common factor 
(H.C.F.) or greatest common measure (G.C.M.) of/ and <f>. 

Case 2. If the last remainder is c, a non-zero constant, 
it must include as before every common factor of / and <f>. 
Since c is free from x we say that, in this case, / and (f> are 
polynomials which are prime to each other. They have 
no common factor. 

Hence any two polynomials are either prime to each 
other or else have a common polynomial factor. The 
above Euclidean process is unique and rational ; and it 
will inevitably detect any factor common to / and <f>. 
We denote the G.C.M. by G, and in Case 2 we take G — 1. 

Theorem 1. Each partial remainder r, including G, 
can be expressed in the form Af+Bcf), where A and B are 
polynomials of degree less than <f> and / respectively. 

Proof. Solving the relation for r, r v r 2 , ... in succession, 
we obtain 

r = f-q<f> = A 0 f+B a <f>, say, 

r i = —?]/+(?? i + 

= (M2+ 1 )/-toi?8+2+?s)^ = A 2 f+B4, 
and so on. In this way the row r k is obtained on multi- 
plying the preceding row by —q k and adding the last but 
one row. Hence each partial remainder is expressed in the 
form Af+Bf), where A and B are polynomials. 

Again, by the mode of formation, A k the coefficient 
of / contains as highest term ±gjg 2 . ,.q k , while that of 
Bk i s N° w from (1) the degree of / is that 

of qtf), the degree of <f> is that of q x r : indeed the continued 
products 

f<f>r...r p _ 2 , q<f>q 1 rq 2 r l ...q p r J) _ l 

have the same degree. Remove the common factors : 
therefore / and qq^^-q^v-i ^ ave the same degree. But 
r j>-i> being a remainder before the last, must involve x 
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and have a positive degree, so that the degree of qq v ^q p 
is less than that of /. 

Once more, on applying the same argument to all 
but the first equation of (1), the degree of q 1 q 2 ---q J > is less 
than that of <j). 

Accordingly the degree of every A is less than that 
of <f> y and of every B is less than that of /. 

Corollary. Polynomials A and B exist for any pair of 
polynomials / and <£, such that either Af-\-B(f) = G or 
Af+B<f> = 1, where G is the G.C.M. of / and <£, or else 
/ and <j> are prime to each other. 

The corollary is proved by applying the above theorem 
to the final non-zero remainder, which is either G or a 
constant c. In the latter case divide Af- \-Bcf) == c through- 
out by c and rename the polynomial coefficients of / and 
<£, as A and B. 

This theorem, with its corollary, is of fundamental 
impprtance in many branches of algebra. 

Examples, l.f — a; 3 41> (f> = £ 2 -fl. Here the successive 
division processes give 

a 8 4-1 = a:(a; 2 41)4( — * + 1) where q = x, r = — #41> 
a: 2 4 1 = (— a? 4 1 )( — ■ x ~ 1)42, ft = —x — 1, r x — 2. 
Hence / and <f> are prime to each other. Also 

x 2 41 — [z 3 41— a;(a; 2 41)]( — % — 1)42. 

Thus (x 41 )(« 3 41 )+(l-»-^ 2 )(^ 2 41 ) == 2, so that 
A( = £(#41)) is of degree less than that of <£, and 
B( «*( l~x —a; 2 )) is of degree less than that of /. 

2. / = a: 3 41> <f>~x 2 — 1. Here # 3 41 = x(x 2 — 1)4^4 1> 
x 2 — 1 = (x— 1 )(# 41 ). Thus G = x + 1, and (a; 3 4l)— a;(a; 2 — 1) 
= #4 1 shows that A = 1, B = —x, where Af-\-B<f> = G. 

18. Reduction to Lowest Terms. A proper fraction 
p(x)/q(x) cannot be identically equal to a polynomial f(x) ; 
otherwise p(x) — f(x)q(x) would be an identity, where at 
least one term on the left is of higher degree than any on 
the right and therefore ( cf . p. 24) could not cancel out. 
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The combination, in sum or difference, of a finite 
number of algebraic proper fractions always gives a proper 
fraction : for 


p + 1 ^ ps+qr 
q s qs 


( 1 ) 


and the degrees of both ps and qr are less than the degree 
of qs when pjq and r/s are proper fractions. For n such 
terms, add them one by one, applying this argument 
each time. 

In individual arithmetical cases this property is not 
necessarily true. 


Example. — - — - H — gives a single fraction whose 

r-fl a : 2 + l 

degree is four in the numerator and five in the denominator. 
Put x — 1 ; the sum is not a proper fraction. 

Theorem 2. If/ is prime to <f> then //<£ cannot be reduced 
to terms of lower degree. 

Proof. If possible let f\f> = fjfa, where f x is prime 
to fa. Divide / by </>, and f x by fa : let the respective 
remainders be r, r v Then 

4 > <£i 


or 




Zi 

<t>x 


where q — q x is a polynomial and the right-hand expression is 
a proper fraction. Hence both vanish, so that r/<^ = rjfa. 
Hence 

Sifi = <l>l<f>i = r/r 1 = .... 


This process has reduced <f>lfa to a fraction r/r x of lower 
degree in both numerator and denominator. Let it 
therefore be repeated until, say, r k /r k+1 is reached, where 
either r k or r k+1 is a constant which may be taken as unity. 
If r k+1 is a constant and r k is not, then / = f x r k , </) = far k 
shows / and both factorized, whereas by hypothesis / 
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was prime to (f > . Similarly, r k cannot be a constant while 
r k+1 is not ; so that both are constant, and so too is 
r k /r k+1 = A, say. Thus / = A f lt <f> = \<f> t and fl<j> can 
differ from by a constant factor only in numerator 
and denominator. Thus //<£ cannot be reduced to terms 
of lower degree. 

Theorem 3. The form Af — G or 1 is unique, 
where A and B are lower in degree than </>/6? and f/G 
respectively. 

Proof. If f and (f> have a common factor, write / = Gf lt 
(f) = G(f> 1 and treat the prime case Af x +1^ = 1. 

Assume now that / is prime to cf>. and that a second 
such identity Cf-\-D(f) — 1 exists. By division express G 
as Q<j>-\-A 0 and D as Pf+B 0 , where A 0 is of degree less 
than that of c/> and B 0 less than that of /. Hence 

(A 0 +Q</>)f+(B 0 +Pf)<l> = 1. . • (2) 

This is an identity, where fcf> is of degree m+ra, A 0 f is of 
lower degree, and so is B 0 (f). Unless P+Q — 0, the identity 
contains terms (P +$)/</> of degree at least m-\-n which 
cannot vanish. This is impossible (c/. p. 24). Hence 
P+Q = 0 and A 0 f+B 0 cf> = 1. 

Once more, A 0 and B 0 must be the A and B of the 
Theorem 1 ; for if not we have A 0 f -\-B 0 (f)=l=Af-\-B(f), 
so that (A 0 —A)f=(B—B 0 )(f>. Unless A = A 0 , B = B 0 
this reduces the fraction //<£ to lower terms (B—B 0 )I(A Q —A) i 
which is impossible when / is prime to cf> : thus A == A 0 , 
B = B 0 and the formula Af-\-B<f) = 1 is unique. 

19. Partial Fractions. Let <f>(x) be capable of 
breaking up into two polynomial factors <f> v <£ 2 which 
are prime to each other. Then, by the Theorem 1, we 
can find polynomials (or constants) A x and ^4 2 , of 
lower degrees than and (f> 2 respectively, such that 

2 = 1- This at once enables us to resolve 
f(x)/<f>(x) into partial fractions ; for 

M = MMMfj _ Af + Af. 

<f>( X ) $ 1 <t>2 


( 1 ) 
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On performing the division, if improper fractions occur 
in either of the pair, we obtain 


f(x) 

<f>(x) 


q(x) + 


fi( x ) 

4> i( x ) 


r 2 {x) 

H*)' 


( 2 ) 


where q(x) is the total quotient, and r l9 r 2 are the respective 
remainders, for the divisors fa and cj ) 2 . 


Example. 


2x* 


(* 2 -fl)(* + l) 


= 2x-2- 


x-l 1 

x-\-\ 


The reader will be familiar with simple examples of the 
method of partial fractions. Certain observations can 
therefore be shortly stated : (i) any common factors of 
/ and cj) are first removed before resolution into partial 
fractions is attempted : (ii) the quotient q(x) is of degree 
m—n and only appears if m, the degree of/, is not less 
than n, that of </> : (iii) fa and </> 2 must have no common 
factor, although each can be of any degree : (iv) if fa 
is of degree p the numerator r x (x) is a polynomial of degree 
less than p. 

If either fa or (f> 2 has polynomial factors the process 
may be repeated, until all the distinct factors of (f> are 
segregated. Thus we express the rational function E(x) 
as a sum of partial fractions in the form 


R(x) = 


f(x) 

<f>(x) 


= «(*) + 


E r — 

M’ 


( 3 ) 


where no two of the denominators have any factor 
in common, and each r(x) is of lower degree than its 
denominator. 

Various cases arise : 

(i) The simple case, when <f>(x) = ( x — a)(x—fi)...(x-X) 

is a product of n distinct linear factors in which a, A 

all differ. We write fax) a). 

(ii) The repeated factor case, when 

<f>( X ) = (a :-a) r (x-py ... , 
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where a, jS, ... all differ but one or more of r, s, ... exceeds 
unity. We write </>(#) = II(x—a) r . 

(iii) The simple quadratic case, when <f>(x) has quadratic 
factors x 2 -\-px+q, with or without linear factors, and 
p 2 <4q. 

(iv) The repeated quadratic case. 

In any case the polynomial quotient q(x) must be 
evaluated by an ordinary division process. In the simple 
case, (i), each partial fraction will take the form 
a/(x—a) y where the denominator is linear and the 
numerator, being of lower degree, is necessarily a constant 
only. Hence we have an identity 


4>[x) 



To evaluate a, multiply throughout by x—a . Every 
term on the right will then contain x—a as a factor except 
the term a itself. Put x = a and the right-hand member 
is a alone. On the left we shall have 


/(a)/{(a /3)(a— y)...(a — A)}, 

which must therefore be equal to a. Similarly for each 
numerator of the partial fractions. 


Examples. 1. 


a 

x+S + - + 

x — l 


b 

x^2* 


Here x 2 — 3# + 2 = (x — \)(x — 2) and q(x) — x + 3 by division. 
Multiply through by a;*— 1 and then put x = 1. Thus 

1 8 

= a. Similarly, - — -- = 6, so that a = —1, b — 8. 

1 — 2 2 — 1 


2. If <f>(x) — (x — a){x — fi)(x—y) where a, y differ, prove 


4>’(x) _ 1 1 1 

<j){x) X — a X — ft X—y 

3 . 

<j>{ x ) (x-a)<l>'(a) T (a:-A)f(A) 
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where <j>(x) = (x—a)(x — p)...(x—\). If f(x) is of lower degree 
than </>(x), the quotient q (x) vanishes. 

To prove this, observe that <j>'(x) = (x — fi)(x— y)...(x — A) 
plus terms all involving x — a as a factor. Hence <£'(a) 
= (a — /9)(a — y)...(a — A). Hence the value of a in the partial 
fraction a/(x — a) is f(a)l<f>'{a), and the result follows. 

4. If f(x) is a cubic polynomial and a, £, y, 8 are any four 
distinct numbers, show that 


f(x) = 2f(a) 


(x-P)(x-y)(x-8) 
(a — p)(a—y)(a — S) 


summed for the four terms due to the distinct combinations 
/?yS, y8a, Sa/1, afiy. 


This follows from Example 3. It is Lagrange’s inter- 
polation formula for a cubic polynomial, and it applies in 
general, with the necessary changes, to the n-ic. 


Case (ii). 
gives 


If cf>(x) = (x— a) r (x— jS) a ... the general rule 


<f>(x) 




A 

(x—a) r 


where A is a polynomial of degree less than r. 

Put x—a = y and express A by Horner’s method as 
a polynomial Pi+P 2 y+P 3 y 2 + --- of degree less than r. 
The partial fraction due to x—a is then (Pi+P22/ + *--)/2/ r > 
which separates at once into the terms 


X'l _i I rr 

(x—a) r (x—a) r ~ x x— a’ 

where the numerators are constants. At most there are 
n such partial fractions : in particular cases some, after 
the first, may vanish. Similarly, here are at most s terms 
due to and so on. 

In practice the calculation of the numerators is trouble- 
some whenever r> 2 ; but p t is obtained at once on 
multiplying throughout by (x— a) r and then putting 
x = a. If r = 2, multiply throughout by (x—a) r , 
differentiate with respect to x and then put x = a. This 
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gives p 2 . In higher cases multiplication and further 
differentiation yield the successive numerators of the 
partial fractions in question, but probably Horner’s 
method is more rapid than this. In some cases equating 
of coefficients gives the results quite rapidly. 


Examples. 1. 

x 3 p q r 8 

— —L 1 } . 

(x-t-2) 2 (x 2 —l) (x~\~2) 2 x + 2 x-1 x + l 

Hence — ■ = p-\-(x+2)q + {x + 2) 2 [...']. 


First put x = — 2 ; then —8/3 — p. Next differentiate: 
the third term on the right then becomes 

2(a + 2)[...] + (* + 2 )*[...]', 


where the brackets indicate fractions with #±1 in the 
denominators to first or second powers. On putting x = — 2 
these denominators do not vanish. Hence the third term 
(# + 2) 2 [...] vanishes after differentiation without tho need 
for calculating explicitly tho expression in the bracket. Thus 


3x a 

x 2 -l 


x z . 2x 
(x 2 — \) 2 


— <? + (# + 


giving q = 4/9. Also r = 1/18, s — J. 

Alternatively, we might form relations between the un- 
known coefficients by substituting other special values of x. 
Here x — 0 gives \p + \q — r+s = 0, so that when the values 
of p, r and 8 are found as before we obtain q — 4/9. 

z 3 A 1 , 1 

2 ' (x+2) t {x 2 -l) ~ (z+2) 1 + 162(a? — 1 ) + 2(a;+l)’ 

where A is a cubic in x and the other fractions have been 
evaluated by the ordinary method. Put x + 2 = y , 
A — Px+P22/+P82/ a +^42/ 3 » an d multiply throughout by the 
denominator involving x. Thus 

fy~l y-31 

(y - 2) 3 =|>1 +p 2 y +p 3 y* +P l y 3 ]{y i -ly+3) +y* I + ~2~ ' 
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Now pick out coefficients of y. We find^ = —8/3, p 2 = 4/9, 
p 3 = —14/27, p i = —41/81 most easily from the coefficients 
of 2 /°, y 1 , y 4 , y 5 , and can use y 2 , y z for checking the result. 

Case (iii). Quadratic partial fractions are 

retained if we wish to avoid complex numbers in the 
case when p 2 <4g. Here there are two real undetermined 
coefficients in the numerator. Paradoxically the values 
of a and b are readily found by resorting to complex roots 
and proceeding as in (i). Multiply throughout by x—a—ift 
where is a complex root of x 2 -\-px\-q = 0 and then 

put x = a+i/?. 

a 3 A ax-\-b 

xamp e. ( a ._pij3( a .2_|_4 a ._|_5) (x-fl) 3 x 2 -\-±x + 5 

Here A is a quadratic in x. The complex factors of a; 2 +4a;-f 5 
are (x-\-2 +i)(#+2 — i). Multiply by x+2 and put 
x = —2 —i. Then 

(— 2 — i) z a(— 2 — i)-\-b 

( — 1— ^) 3 (— 2i) = ^2 i # 

This reduces, sinco i 2 — — 1, to 9/4 — 1 3i/4 = b—2a—au 
Since a and b are real, we have b— 2a = 9/4, a == 13/4, 

13 9 7 

and so 6 = 35/4. Also A = — x 2 — - x— -. 

4 2 4 

A 

Case (iv). The real partial fraction — — may 

(x 2 +px+q) r 

occur, where p 2 < 4g. Since the denominator is of degree 2r, 
that of A may be 2r— 1 or less. On dividing A repeatedly 
by X = x 2 +px-\-q y let the remainders be R ly R 2y .... Then 
the fraction can be written 

Rr . ^r-1 , I ^1 

X r Z r_1 ’”~ t ~ X ’ 

Such remainders R are linear and real. Hence the original 
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rational function has been reduced to a sum of terms of 
the following possible types : 

a a ax-\-b ax-\-b 

^ X ^ x—a * (x—a) ri x 2 +px+q’ ( x 2 +px-{-q ) r ’ 

If the coefficients in the original function f(x)/<f>(x) are 
complex, cases (iii) and (iv) are unimportant. They are 
significant when we wish to express a real function in its 
simplest partial fractions. 

The following examples suggest further aspects of the 
subject. 

I 2x _ l 1 _ 2 2 

x 1 +4 x-\-2i x — 2 i 4x 2 — 3 2x + V3 2x— V3 

_ 5 —x ax-\-b c „ , . 

2. ; — — = — — - -\ , find the constants <z, 6, c. 

(a: — 1 ) (a; 2 — 3 ) x 2 -3 x-l 

This is the form to use if we wish to avoid irrational real 
numbers. Multiply by x — 1 and put x = 1 ; thon c = — 2. 
Multiply by x— y/3 and then put x = \/3. Hence 

5—V3 aV3-\-b _ _ _ 

<V3-1)2V3 2VT’ 80 that 6 - ^3 = ( V3 - l)(a V3 +6), 

or 5-f6 — 3o = V3(l -f&— a). From x-\~\/3 we get the 
same result but with — V 3 for -f- V 3. Hence each side of 
this equation vanishes separately and we have a = 2, b — 1. 

3. If f{x) is of lower degree than <f>(x) — (x — a)(x — fi)(x — y), 
where a, j3, y are distinct, prove that 

/W 

m 

4. If a — prove the result corresponding to the 

above, where the second column of each determinant is 
replaced by the derivative of the first column with respect 

to a; that is, these columns become i- , 1, OV and 

Ida r — a J 

{2a, 1, 0} respectively. 


/<«) /( P) f(y) 


a 2 

P* 

y 2 

x — a x — ft x — y 


a 

P 

y 

a p y 

1 1 1 


1 

i 

l 
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20. Determinantal Form of Partial Fractions. 

If f(x) is a polynomial of degree less than that of <f>(x)> 
which is supposed to have no repeated factors, we can 
resolve into the form 


/(*) 

<j,{x) 


1 

1 

.. 1 

a 

P • 

.. A 

a 2 

ft 2 • 

.. A 2 

a n ~ 

-2 fin- 2 

.. A" -2 

A a 

)M • 

-/(A) 

X — 

a x—fi . 

.. x—\ 


1 

1 

.. 1 

a 

P • 

.. A 

a 2 

P 2 • 

.. A 2 

a "- 2 

/5 n-2 .. 

..A"- 2 

a "- 1 

/ S "- 1 .. 

.. A"- 1 


For the denominator determinant, A say, is the n- 
rowed alternant (Ait ken, Determinants and Matrices , p. 41) 
which has the \n{n— 1) linear factors such as j8— a. In fact 


A 


= J(aj3.../cA) = (/?— a)(y— a). ..(A— a) 


. . (2) 


x(A— k). 


Also the numerator determinant may be expanded 
according to its final row as 


2 


Af( a) 

x—a 


( 3 ) 


the summation being of n terms corresponding to a, ft, 
..., A, where A is an alternant of n— 1 letters. Hence the 
right-hand side of (1) possesses the typical term 

A(aP...K)f(X) m (4) 

A(ap... K \)(x-X) (A— a)(A— /J)...(A— /c)(a;— A) v ' 

which is precisely the term given by the last partial 
fraction of f(x)l<j)(x). Symmetry shows that all the partial 
fractions are obtained in this way. 
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21. The Confluent Case of Partial Fractions. Let 

H*) = (x-ay(z-py... 9 . . (i) 

where a, j8, ... are all distinct. We call this the confluent 
case, where r of the roots a may be regarded as originally 
distinct but tending to equality, s further roots /} tending 
to equality, and so on. If in the above determinants 
two roots are made equal the quotient is indeterminate, 
but if in the second column of each determinant a is 
replaced by a-\-h and h is allowed to tend to zero, the 
quotient of the determinants may be evaluated. When 
the first r roots are confluent and equal to a we take 

du 1 d 2 u 1 d T ~ x u o 

M ’ da 2! So 5 ’ (r— 1)! da'- 1 ’ (2) 


to be the first r elements of any row, in numerator and 
denominator determinants, where u denotes the original 
entry in the first column on the row in question ; that is, 
we differentiate each column successively and divide by 
a suitable factorial as in Taylor’s expansion. The next 
s columns are formed in the same way afresh from a new 
root j8 ; and so on. For example, if f(x) is a cubic or 
lower polynomial, 

f(x) 

(x-a) z (x-p) = 


1 


a 1 

a 2 2a 1 

/(a) //(a) V W/WV 

x — ala; — aj 2\\x — a! 


1 

P 

P 2 

m 

x-p\ 



1 . 

. 1 


a 1 

P 


a 2 2 a 1 /S 2 

a 3 3a 2 3a 0 3 


(3) 


where accents denote differentiation with respect to a. 

Proof. Proceed by induction on n, the degree of the 
denominator (f>(x ), and arrange the cases for the same 
value of n in the lexicographical order of the partitions 
{ rs...t} f where r +$+...+£ = n. The ordinary case is 
{11... 1}, and all other cases are confluent. For n = 4 
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the order is therefore {1111}, {211}, {22}, {31}, {4}. Any 
confluent case {...p} or {...pi...}, where p is the last index 
exceeding unity, is derivable from an earlier case {. . . (p — 1)1} 

or { — (2? — 1)11 } by letting the index p — 1 merge with 

the next succeeding unit index. 

Now assume the truth of the theorem for this earlier 
case, letting (x—y) v ~ 1 (x— S) be the factors in <j>(x) answering 
to the indices p — 1, 1 which become confluent when 
8 — y+h and k-+0. The final entry of the column 
answering to x—S in the upper determinant will then be 


f(y) 

X—y 


+h 


/( S) = f(y+h) 

X — 8 X — y—h 



h 

(p-1)! 



(P-1) 

+ .... 


Here every term before the p th in the series will disappear, 
by subtracting a suitable multiple of one of the p — 1 
columns just preceding — for similar series on each row of 
either determinant. So these may be discarded. After 
cancelling the factor h 3,-1 from both determinants and 
then letting h-> 0, the required result follows. 


Example. Replace the third columns in (3) above by 
(1, y, y 2 , f(y)/(x — y)} and (1, y, y a , y 8 }, and the left-hand 
expression by f(x)/(x — a) 2 (x — f$)(x— y). Assuming the truth 
of this identity, let y = a -\-h. The third columns are now 


1 


t t-\~h 


a ~\~h 
(a+h) 2 
du h 2 d 2 u 

el + 2i + 


and 


1 

a -\-h 
(a+h) 2 

(*+h)\ 


where u — f(a)l(x — a). The operation col 8 — cob— /icol 2 , 
followed by cancelling h 2 and then putting h = 0, yields at 
once the result desired, corresponding to Case (ii) of 19, p. 41. 

Corollary, We append a proof, by a similar induction, 
of the theorem on the value of the confluent alternant : 


A( a r P'y» ... A<) = I1(P -a)”, 

D 
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where the index rs is the product of the corresponding 
indices in A . 

Proof . It is understood that r +«+...+$ = n, that 
a, jS, A are distinct, and that the blocks of confluent 
columns are arranged as in the denominator determinant 
above. Let _p(>l) be the final confluent index. 

Assuming the truth of the theorem for the partition 
{...(p — 1)1...} we have, say, 

J(...y p_1 S...) = (8— y) p_1 (y— a) r(p ~ 1) (8— a) r i/f, 

where the product index p — 1 is obtained from the indices 
of y and 8 in J, and ip denotes all remaining factors. Let 
8 = y-f-A, so that the right-hand expression becomes 

h»-\y — a) r(p-1) (y -\-h —a) r \p. 

Divide by h v ~ l as before and let 7&->0. The left becomes 
the required new confluent determinant, and the right 
is a product of factors such as 

(y — a) rp ~ r+r = (y- a) fp 

which involve y with a or any other root distinct from y, 
together with factors such as (/3— a) r * which are unchanged 
throughout the process. 

Examples. 1. In the denominator of (3) above, the 
value is 

(0-a) 3 = A(aaafl) = d(a 3 £). 

On the other hand, A(a 2 f$y) = (y — a) 2 (/? — a) 2 (y — j9). 

2. d(aW) = (jS — a) 12 (y — a) 8 (y ^)®. 

3. Integrate with respect to x: f(x)/(x—a)(x—p)(x—y), 
f(x)l(x-a) 2 (x--p),f(x)/(x—a)*. (Replace f(a)/{x—a) through- 
out by /(a)log(a;— a) in the determinantal formulae equivalent 
to the sum of partial fractions.) 

22. The Expansion of a Rational Function. Partial 
fractions give a means of expanding a function of x in 
ascending or descending powers of x. We shall illustrate 
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this for the simple case when all the factors of cf>(x) are 
distinct. Let 

/(*) = « . b , . I 

<f>(x) x—a'x—p *’’~x— A* 

Now a/(x— a) = a(^ -1 +ao;~ 2 +a 2 ^ _3 +a 3 ^ _4 +...), . (1) 

provided that this series converges, which happens if 
|x|>|a|. This means that x is numerically greater than 
a if both are real, and that the modulus of x exceeds that 
of a if both are complex. For a proof we may refer to 
any text on analysis. We may regard the statement 
just written as a geometrical progression summed to 
infinity or equally well as an example of the binomial 
expansion of ax~ 1 ( 1— ax -1 ) - * 1 . 

Now take \x\ greater than the greatest among |a|, 
|j8|, |A|, expand each fraction, and add the results. 

Then 

R(x ) =zf{x)!<f>{x) = (Sa)x- 1 -\-{Saa)x- 2 -\-{Eaa 2 )x-^-\-..., (2) 

and this is the development of R{x) in descending powers of 
x, valid for sufficiently large values of \x\. 

Similarly, when |x|< |a| we have 

al(x—a) = — a(a -1 +ci~ 2 x-|-a~ 3 x 2 + ...). . (3) 

Hence for sufficiently small values of x, in fact when \x\ is 
less than each of |a|, |j8|, ..., |A|, we have the corresponding 
expansion in ascending powers of x : 

R(x) = f{x)j<f>(x) = —2Jaa~ l — (2Jaa~ 2 )x—(Uaa~ s )x 2 — ... . (4) 

Unless one or other of a, /?, . .., A vanishes this expansion 
is possible. Therefore if x is not a factor of <j){x) we may 
expand f{x)l<f>(x) in ascending or descending series of powers 
of x, for suitably small or large values of x. The same 
applies when <j>(x) has repeated roots, where the work 
may be carried out on terms such as a(x—a)~ r by the 
binomial theorem. If, however, x occurs as a factor in 
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<j)(x), we remove the x before making an ascending expan- 
sion, and afterwards replace it. 

Example - gip-i) = i {(2— *)(i — 

= x~ z (2 — x-\-2x 2 — x 3 + ---) 

= 2x~ z —x~ 2 -\-2x~ 1 — \-{-2x — ... . 

Such a series, involving both ascending and descending 
powers of x , is called a Laurent expansion. Here there 
is necessarily a finite number of negative powers and an 
infinite number of positive powers. 

For a descending expansion there is no restriction on 
the denominator. In this example it is 

x-*-2x~*+x ~ 6 -.... 

It is very useful to notice that the leading term in 
either expansion is given by the dominating terms of 
f(x) and <f>(x). In ascending series the dominating term 
is the lowest in powers of a;: in descending it is the 
highest. 

Exampl03 - E ^-£ + 2 = ££ + SZ2* 


where a = — 1 , b — 2, and this on expansion gives 

— (a; -1 -f x ~ 2 -\-x~ z + ...)+ 2(a; _1 -f- 2x~ 2 -f 4a; -3 + . . . ) 

= x- 1 + 3x- 2 + 7a;- 3 + ...+(2 n -l)a;- w + ..., 

if x>2 or < — 2 when real ; or if |#|>2 when x is complex. 
Again 


1 

1 —x 




= (l+a;+a; 2 -f ...)— + 

= \x -f- 1^ 2 + ••• + ( 1 — 2~ n )a; n + • • • > 
if —%<x< J when x is real ; or if |#|<£ when x is complex. 


2. Expand x A /(x z — l) in ascending and also in descending 
powers of x. Is a Laurent expansion possible here ? 
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23. Recurring Series. When x is not a factor of 
c j)(x ), any rational function R(x) may be expanded as above 
in powers of x. Suppose, therefore, that 

f(x)l<f>(x) = p 0 +p 1 x+p 2 x 2 +...+p n x n +... . 

The sequence of coefficients p 0 , p v p 2 , ... is then called a 
recurring sequence or series. Such a series is characterized 
by a scale of relation, which means that, after a certain 
value n 0 of n, each coefficient p n is formed in exactly the 
same way by a linear combination of its r immediate 
predecessors. 

A scale of order r = 1 is given by p n = ap n _ v where 
a is constant and n = 1, 2, 3, ... in succession. A scale 
of order 2 is given by p n = ap n _ l -\~bp n _ 2 , where both 
a and b are constant : and so on. It is easy to prove that 
if (f>(x ) is of degree r the scale is of order r. The proof 
is left as an exercise for the reader, but is here illustrated 
when r = 2. 

Let f(x)l<f>(x) = q(x)+r(x)l<f>(x) 

in the usual way : then q(x) is a polynomial which may 
interfere with the recurrence law for the first few terms, 
but r(x)/<f)(x) gives the recurring series proper. We have, 
say, when x is suitably small, 

r(x)!f)(x) = (cx-\-d)/(x 2 +gx+h) 

= u Q -\-u 1 x-\-u 2 x 2 +u z x*+..., 

where we assume that 0 since <f>( 0)^0. Multiply 
throughout by x 2 -\-gx-\-h and equate corresponding 
coefficients of x°, x 1 , x 2 , ... on both sides of the resulting 
identity. Thus 

d = u 0 h, 0 == u 2 h+u x g+u z , 

c^Utl h+u 0 g, 0 = u 3 h+u 2 g+u v 

0 = u n h~\~u n _i g+u n _ 2 , n> 1. 

Since h=£0 this gives a recurrence relation for u n in terms 
of u n _ l and u n _ 2 , where n = 2, 3, 4, .... 

Conversely, we may sum a given recurring series to n 
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terms or to infinity (i) if the scale of relation is given, 
or (ii) if sufficient terms are given so as to reveal a scale 
unambiguously. 

(i) Let s = u 0 -\-u 1 x-\-u 2 x 2 + .».+u n x n -{ where u 0 , u x 
are given, and also u n = au n _ 1 +bu n _ 2 for n = 2, 3, ... . 
Multiply throughout by ax and also by bx 2 . Thus 

axs = aiiQX+au^ 2 +au n _ x x n 
bx 2 s = bu Q x 2 +bu n _ 2 x n -\ ... . 

Hence by subtraction, carried out by powers of x , 

(1— ax— bx 2 )s = au 0 )x, 

since every further coefficient of a power of x disappears 
owing to the scale of relation. Thus 

__ Uq + aitf^x 

l—ax—bx 2 

which is a rational proper fraction with a quadratic 
denominator (r = 2). To sum the series to n terms we 
proceed in the same way, but retain the non-vanishing 
terms involving x n+1 and x n+2 . 

The procedure is palpably a generalization of the well- 
known method of summing a geometrical progression : 
indeed the latter is the case of a recurring series for which 
r = 1. 

(ii) Given r = 2 and the series 1 +3# +7a; 2 + 15a; 3 

to find the general term, the scale of relation, and the 
rational function s. 

Assume u n = au n _ x -\-bu n ~ 2 ; then 7 = 3 a-\-b, 15 = la 
+36. Hence a — 3, 6 = —2. Now proceed as before : 

(1— 3a;+2a; 2 )s = l+(3-3)x = 1. 

Thus 

1 1_ 2 

8 1— Sx-\-2x 2 l—x^l—2x 

= l+3x+lx 2 +...+(2 n+1 -l)x n +... . 

The general term is here shown. 
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Examples. The reader should take simple rational 
functions such as (2 — 3#)/(l — 5# + 6# 2 ), should expand them 
to several terms in ascending powers of # (using either partial 
fractions or ordinary division), and should then try to recover 
the rational function from the recurring series by the methods 
outlined in the present section. 

1. Determine the scale of relation and the coefficient of the 
n th term and the sum to infinity of the recurring series 

4-8#+28# 2 -80# 3 +... . 

For what values of x can it be summed to infinity ? 

2. Sum to infinity 2+5#+ 13# 2 + 35# 3 + ... • 

3. Discuss the series l + 3#+6# 2 + 10# 3 + 15# 4 +21# 6 +... . 

4. Show that 

1+# cos 0+# 2 cos 20+# 3 cos 30+... 

is a recurring series. So too is the corresponding series 
with each cosine replaced by a sine. 

|> n — 2w n _! cos 0+*V- 2 = 0 ]. 

5. If |#|<1 the sum to infinity of the above cosine 
series is 

(1—# cos 6) /( 1 — 2# cos 0+# 2 ). 

6. Discuss the hyperbolic cosino and sine series corre- 
sponding to (4). 

[Answers 1. 1 — ( — 3) n , 4/(1 +2#— 3# 2 ), |#|<£. 

2. (2 — 5#) / (1— 5#+6# 2 ). 3. r = 3 ; (l-*)-*.] 
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THE FUNDAMENTAL THEOREM OF ALGEBRA 

24. Statement of the Theorem. The basic theorem 
regarding algebraic equations may be stated as follows : 
Every equation 

f(z) = z n +a 1 z n - 1 +a 2 z n ~ 2 +...+a n = 0 

in which the coefficients are arbitrary, real or complex 
numbers has at least one root z = a+ib, where a and b 
are real. 

The first satisfactory proof of this theorem was given 
by Gauss (Werke, iii. 1) in his dissertation : Demonstratio 
nova theorematis omnem functionem algebraicam unius 
variabilis in factores reales primi vel secundi gradus resolvi 
posse (Helmsted, 1799). Gauss incidentally criticized the 
earlier defective proofs put forward by D’Alembert, 
Euler, and Lagrange. Gauss gave two further proofs 
in 1815 and 1816, but returned to his original method 
in 1849. A new proof was given by Cauchy (Cours 
d > analyse algtbrique , ch. x, 1821), and later by Sturm 
(Journ. de Mathimatique , i. 1836). 

Although this theorem is fundamental for algebra, its 
proof belongs to the theory of analytic functions, which 
is a branch of analysis. The simplest proof runs as 
follows: f(z) vanishes if 1 jf(z) is infinite. Now 1 /f(z) is 
an analytic function of z and, according to a theorem of 
Liouville, must either be a constant or else become infinite 
at one or more values of z. Hence f(z) vanishes at least 
once. Liouville’s theorem allows this value of z to be 
either infinite or finite. Now /(z)->oo if z->oo ; hence the 
so 
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value in question can only be finite. This proves the 
theorem. 

A direct appeal to contour integration may be made 
which obviates the use of Liouville's theorem. The 
reader will find the matter discussed in textbooks on 
analysis ; for example, Mac Robert, Functions of a Complex 
Variable (1917), pp. 67-69, Phillips, Functions of a Complex 
Variable (1940), p. 113. 

The original method of Gauss may be sketched as 
follows : Let z = x-{~iy 

and f(z) = f(x+iy) = u(x , y)+iv(x, y), 

where the polynomial f(z) has been separated into two 
parts, real and imaginary. For instance, 

z 3 +2z+3 = (x+iy)*+2(x+iy)+3. 

Here u = x 3 — 3xy 2 +2x+3, v = 3x 2 y—y 3 -\-2y. 

Both functions u and v are polynomials in x and y whose 
coefficients are real. Now suppose the curves u = 0, 
v = 0 to be drawn, for Cartesian coordinates x and y. 
If we can show that these curves intersect at a real finite 
point P = ( x f y), then at this point both the polynomials 
u and v vanish simultaneously. Hence f(z) vanishes, 
and the point P represents a root z = x-\-iy of the original 
equation /(z) = 0. 

Now let x — r cos 0, y = r sin 9. Then 
z = r(cos sin 9 ), 

and by Demoivre’s theorem z w = r n (cos n9-\-i sin n9). 
Hence 

u = r n cos n9+r n ~ 1 {. v = r n sin 7i0+r n_1 (...) + ... . 

The polar coordinate forms of the curves may therefore 
be written 

cos n9-\-r'~ 1 X = 0, sin n9-\-r^ 1 Y = 0, 
where X and Y are terms in cosines and sines of multiples 
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of 0, with powers of r in the denominators. If r is large 
we obtain cos nd = 0, sin nd = 0 as approximations to 
the curves, that is to say, radii from the origin at equal 
angles 27r/n starting from 9 = \tt for the cosine, and 
9 — 0 for the sine. If with centre at the origin 0 a circle 
r = R is drawn with a large radius R, and a regular polygon 
ABC... is inscribed, with 4 n vertices beginning at 9 = 0, 
the above results mean that, as R-^co, the curves u = 0, 
v = 0 must approximate to the alternate radii OB , OD , . . . 



for u f and OA , 0(7, ... for v. An asymptote of the u curve 
would be parallel to OB and at a finite distance from OB. 
Thus it would cross the circumference ABC at a point Q 
such that the arc QB would remain finite while QA would 
tend to infinity, as OB = R->co. Similarly for each vertex. 
Hence when R is large enough the u curve runs out of 
the circle near the n points B, D, ..., while the v curve 
runs out near the alternate points A, C, ... . 

Within the circle the curves are continuous, so that 
the 2 n points near B, D , ... on the u curve must link up in 
pairs by means of n arcs, however complicated or twisted. 
Similarly for the v curve. It is topologically impossible 
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to draw such arcs, starting at alternate circumferential 
points for u and v , without at least one crossing of a u 
arc with a v arc. Such a crossing gives the required 
point P and proves the theorem. 

The above is a sketch of the proof. It obviously 
involves a knowledge of the algebraic curve, given by a 
real polynomial in x and y, and particularly the property 
that each arc of such a curve is continuous. The proof 
depends ultimately on the property that if Q , R f S, T are 
four points in order on a circle, a continuous path from 
Q to S within the circle must cross a continuous path from 
R to T. 

25. The Product Form of an Algebraic Equation. 

Let us now return to the notation f(x) for a polynomial, 
where x may be real or complex. The fundamental 
theorem has shown that a root a of the n th degree equation 

f(x) = a 0 x n +a 1 x n - l +a 2 x n ~ 2 -t-...-i-a n _ 1 x+a n = 0 

always exists ; that is, /(a) = 0. Now perform the 
division f(x) — (x—a). We may write as before 

f(x) = q(x)(x-a)+r(x). 

Since the divisor is linear the remainder is either a constant 
or zero. Thus r(x) — c, where c is a constant which 
may be zero. Now put x = a in the identity, which 
becomes /(a) = c. Hence c vanishes with /(a), so that 
x—a is a factor of f(x), whenever a is a root of f(x) — 0. 
From the first step of the actual division we notice that 
q(x) is a polynomial with a ( pc n ~‘ 1 for leading term. Hence 
f(x) takes the form 

f{x) = (x —a) (atfc? 1 - 1 -\-b x x n - 2 + . . . +6 n _i) . 

(Actually b x = a 0 a+a v b 2 = a 0 a 2 -j-a 1 a+a 2i etc.) By the 
fundamental theorem, q(x) == 0 also must have a root /}, 
such that x— j3 is a factor of q(x). Hence 

f(x) = (x-a)(x-p)(a ( ? n - 2 +c l x n -*+...+c n _ 2 ). 
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The process may be repeated on the quotient until the 
final quotient is a constant, a 0 in fact. This happens 
when n such linear factors have been segregated. Thus 
we shall have 

f(x) = ct 0 (x-a)(x—l 3)...(x-X), a 0 ^0, 

where the n numbers a, j9, A are called the n roots 
of the equation f(x) = 0. We have reduced f(x) to its 
product form, or have resolved the polynomial into n 
linear factors. 

Example. f(x) = 2x 3 — 5x 2 — 9x -f 18. 

Here /( 3) = 0, so that x — 3 is a factor. We have, by division 
or by Horner’s method, 

f(x) = (2^ 2 -|-a;-6)(^-3) 

= (2jj — 3)(a? + 2)(a? — 3) 

= -^(* + 2 )^- 3 ) 

3 

where the roots are —2, 3. We note that the leading 
coefficient, a 0 = 2, survives as a factor in the final resolution. 

26. Repeated Factors. Equal Roots. It may happen 
that some of the numbers a, /?,..., A are equal. If exactly 
r of them are equal to a then we say that a is a root repeated 
r times, or is an r-fold root. Clearly (x— a) r is then a factor 
of f(x). Allowing for such repetitions we have now 
expressed f(x) as 

f(x) = a 0 (x-a) r (x-P)'...{x- X)*, 

where r +$+...+£ = n. Hence the indices r, s y ..., I 
form a partition of n, and the simple or unrepeated case 
corresponds to the partition {11... 1}. Obviously f(x) 
may have either one, two, ... or n distinct linear factors : 
it cannot have more, for this would imply a leading term 
a 0 x m in f(x) of degree exceeding n. 
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Again, if 9 is any number distinct from a, /?, A then 
each of 9— a, 6— j8, ... is non-zero. Hence 

f(0) = a 0 (9—a) r (9 — P) t ...(9—-X) t ^0 y 
so that f(9 )^ 0 and 9 is not a root of the equation. We 
have found n roots (allowing for repetitions among them) : 
we see that no other number is a root. Hence there are 
exactly n roots of the equation. 

This completes the result of 13, where it was proved 
by rational methods that the equation f(x) = 0 could not 
have more than n roots (p. 23). Apart from the order in 
which the factors occur the resolution of f(x) into its factors 
is unique. 

The following theorem concerning an equation with 
repeated roots is important : When the n th degree equation 
f(x) = 0 has exactly r roots equal to a then f(x) and its first , 
second , ..., (r— l) th derivatives vanish at x = a, but its 
r th and following , up to the n th inclusive , do not. 

Proof. In this case we can write f(x) = (x—a) r i/j(x) 
where i/j(a) ^0. Hence, by differentiation, 

f\x) = r{x—a) r - 1 ifj{x)+(x~a) r ifj'{x) y 

so that f(x) contains the factor (x— a) r_1 . Its cofactor 
is rifj(x) J r(x—a)ifj'(x), and, when x — a, this is equal to 
n/r(a) which is non-zero by hypothesis. Thus the first 
derivative contains the factor x—a exactly r— 1 times. 
The same method shows that f"(x) contains the factor 
r — 2 times : and so on until the (r — 1)* A derivative 
contains the factor once, and the r th is free from the 
factor. Thus the theorem is proved, yf 

Example. f(x) = x 3 — 4x 2 + 5x—2, 

f'(x) — 3x 2 — 8a; + 5, f*(x) — 6x— 8. 

Here f(x) = (x-l)*(x-2)J'(x) = (a?-l)(3*-5), and /(l) = 0, 
f'(l) = 0,/'(l) ^ 0. The root 1 is repeated twice. 

We may note that if, as here, f'(x) can readily be factorized, 
it is easy to find the repeated roots. The second zero 5/3 
of f'(x) is of course not a root of f(x) : nor is the single zero 
/'(*). 
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The geometrical interpretation of a repeated root is 
simple. If /'(a) = 0 the gradient of the graph is zero at 
x = a, and if also /(a) = 0 the graph meets the axis of 
x at the point (a, 0). Hence if a is a repeated root of 
the equation f(x) = 0 the graph touches the axis of x 
at (a, 0). It is worth while drawing the graphs of /, /', 
and f” in order to gain insight into the behaviour of / 
at a repeated zero. Furthermore, by Taylor’s theorem, 
we have 

/(*) — f (a -\-x — a) =/(a)+(x-a)/'(a)+...+ ^=^- n / <n, (a). 

If /(a) and its first r— 1 derivatives all vanish but not the 
r th , then (x— a) r is a factor of every term in this series 
and therefore of f(x). Hence the converse of the above 
theorem is true. 

An important corollary of the theorem, having reference 
to the G.C.M. of f(x) and f'(x), is as follows : If f(x) contains 
the factor x— a exactly r times , then G(x), the G.C.M. of 
f(x) and f'(x), contains the same factor exactly r— 1 times , 
and G(x) is composed entirely of such repeated factors of f(x). 

Proof. If /(a) and /'(a) both vanish then x—a is a 
common factor of f(x) and f'(x). Since it occurs to degree r 
in /(a), it occurs to degree r — 1 i nf'(x), so that G(x) contains 
this factor to the power r — 1 also. 

Now G(x) cannot contain a factor x—d unless both 
f(0) and f'(0) vanish. Hence G(x) can only be 

(x—a) r ~ l (x— jS)* - * 1 . . . (x— 
where a, jS, ..., A all differ. 

Again, if f(x) is any given polynomial, not necessarily 
factorized, we can find both f'(x) and G(x) by the ordinary 
rational methods. Also we can write 

f(x ) = G(x) h(x) 

where h(x) y the cofactor of G(x) in f(x), can be ascertained 
by long division. Every zero of h(x) is manifestly a zero 



REDUCED EQUATION 


63 


of f(x), but it is more interesting to note that, conversely, 
every zero of f(x) is also a zero of h(x) and that h(x) has no 
repeated factor. Indeed 

= M = a ( x -a) r (x-l3)*...(x-\y 
G(x) 0 (x—a) r - 1 (x—fi) a - 1 ...(x—A)*- 1 

= a 0 (x—a)(x—j 3)...(x—\). 

The equation h(x) = 0 is called the reduced unrepeated 
form of f(x) = 0. We note that if any index t is unity 
G(x) has no corresponding factor. 

In practice all repeated factors may be removed by 
this method, even if the separate factors x—a cannot 
easily be identified. If they can, so much the better. 

Examples. 1. Solve 

x 7 — a: 8 — 1 lx 5 + 1 la: 4 -}- 19a; 3 — 19a: 2 — 9a:+9 = 0. 

(The roots are 1, 1, 1, —1, —1, 3, —3.) 

2. The reduced unrepeated form of 

x 7 — a, 8 -f 2a: 5 + 2a: 4 — 3a: 3 + 3a: 2 — 4 

is (a: 8 -fa:-|-2)(a: — 1). There are three pairs of repeated roots 
and one unrepeated. 

27. Complex Roots of an Equation. Starting with 
the known fact that a quadratic with real coefficients may 
fail to have real roots, we can easily construct an equation 
of the n th degree with less than n real roots : thus 
{x 2 ~\-\){x 2 +x+l) = 0 is a quartic equation with no 
real roots ; and (x— l)(a; 2 + l) 2 = 0 is a quintic with only 
one real root. We must therefore be prepared to consider 
complex roots of a real equation 

f(x) = a Q x n +a 1 x n ~ 1 +...-) r a n = 0, 

where each coefficient a, is real. Incidentally, we may 
merge the case of complex equations, where the a { are 
complex, in that of the present case where the coefficients 
are real, by the method of 5, Ex. 2, p. 11. The roots of a 
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complex equation of degree m may be regarded as roots of 
a real equation of, at most, degree 2 m. 

Example, x 2 -— ix — l— i = 0 is a complex quadratic whose 
roots are included among the three roots of the real cubic 
x*-x 2 + 2 = 0. 

Let us then confine ourselves to real equations. If 
a+ij8 is any complex number we have 

/(a+i£) = A+iB 

where a , ft, A and B are real. This result is obtained by 
a straightforward reduction due to replacing i 2 by — 1 
whenever it occurs. Since (— i) 2 = — 1 we should obtain 
in exactly the same way 

/(a-ijS) = A-iB 
where A and B are the same as before. 

Example. 

d{a -\-ifi) 2 -f-6(a -f-^/3) -|-c = d{a 2 — f$ 2 ) -\-ba -f-C -|-'i(2(2aj3 
a(a— ip) 2 -\-b(a— ip)-\-c = o(a 2 — p 2 )-\~ba~\-c— i(2aap-{- bj3). 

Now suppose that a-H/? is a root of f(x) — 0, so that 
A+iB = 0. Since A and B are real, this is only possible 
when ^4 = 0, B = 0. Hence A —iB — 0, so that 
f(a—ifl) = 0, that is, a—-i/3 is also a root. Hence we 
have proved the following property : 

AU complex roots of a real equation occur in pairs , 
such as which are conjugate complex numbers: the 

number of complex roots of a real equation is therefore even . 

Since (x—a— ifi)(x— a+i/?) = (x— a) 2 +j3 2 , which is real 
and positive whenever x is real, it follows that every real 
polynomial may be expressed in the form 

f(x) = a 0 (x-a)...(x 2 +px+q)... 

= a 0 U(x-~ a)U(x 2 -\-px-\-q) y 

where n indicates a product of typical factors, and in 
each quadratic factor p 2 <4q. This is a resolution into 
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real factors, linear for each root a of f(x) = 0, and quadratic 
for each pair of conjugate complex roots. Of course 
there may be repetitions among the roots, but if a complex 
root is repeated, so is its conjugate, and so therefore is 
the real quadratic factor x 2 -\-px-\-q. 

In terms of real numbers we can now say that every 
real polynomial can be factorized into real linear or 
irresoluble quadratic factors, and that such a factorization 
is unique. 

Examples. 1. Find the real factors of a; 4 + 1 Ox 2 -f 169, 
a; 6 -fl, x 4 — 16# 2 -f* 40#— 25. 

2. Solve the equation 27x i —45x 2 -j-26x — 4 — 0, given that 
it has a repeated root. 

3. Find the reduced unrepeated form of the following 
equations : 

(i) x 4 — \ 8a; 3 -f- 69x 2 -f- 1 08# -J- 36 = o. 

(ii) # 4 — - 1 O# 3 + 3 7x 2 — 60# -f 36 = 0. 

(Each equation has two pairs of repeated roots.) 

4. Solve the equation x 4 — 8a: 3 -f 16a; 2 — 64# -f 64 = 0, which 
has two roots of the type ±?j8. 

5. The equation x 4 —4x 3 ~j-7x 2 -j-6 =0 has no real roots. 

(Consider (x— l) 4 -f- (#-f-2) 2 -fl.) 

6. Find four real roots of a 8 — 47x 4 -fl = 0. 

7. Solve (x— l) 7 = x 7 — 1, which has two repeated roots. 

[Answers : 

2. Pat 3x=y ; x=$, — i(l±A/13). 

4. ±2y/2i, 4±2 v / 2. 

6. ±i(3±V 5 )-] 
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PROPERTIES OF THE COEFFICIENTS OF AN 
ALGEBRAIC EQUATION 

28. Tlie Elementary Symmetric Functions. Let a, 

A be the n roots of the equation 

f(x) = a 0 a: n +a 1 a; n - 1 +... +a n _ 1 a;+a n = 0, a 0 ^0. (1) 

Then f(x) — a 0 (x—a)(x—fi)...(x— A). . . (2) 

Equating these two expressions we have an identity which 
is true for all values of x ; hence we may multiply out the 
bracket factors and equate coefficients of corresponding 
powers of x. Thus 

Uq = a 0 , 

a^Za = a 0 (a +/?-[-. ..) = — a lt 

a o ^ a P — a o( a P + • • •) = ^ 2 » • • ( 3 ) 

cioEafiy = a 0 (afiy+...) == — a 3 , 


— ( ) n &«> 

or 2?a = —a x ja Qi Zap — a 2 ja 0 , Zafty = — a z \a Q , etc. These 
relations express the coefficients a, in terms of the leading 
coefficient a 0 and of the elementary symmetric functions of 
the roots (Girard, 1629 ; Harriot, 1631 ; Vieta, 1646). 
The m th elementary symmetric function is the sum Z of 
the combinations of the n letters a, ft, ..., A taken m at a 
time, so that Z has n (m) terms, where 
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the well-known binomial coefficient. We may refer to 
these sums as the sums of ra-ary products of the roots. 

Examples. 

For ax 2 -\-2bx +C, Za — a + p = — 26/a, a p = c/a. 

For a 0 ^ 3 +a 1 a; 2 -}-a2 ir + a 3» Z a — a + jR-hy = — aja 0 , 

ZaP — aj8-f ay+/?y = a 2 /a 0 , ctjSy = — a 8 /a 0 . 


For a quartic (biquadratic) Eafi has six terms and 
2af3y has four. It is important to become thoroughly 
familiar with these relations (3) for the cubic and quartic 
cases. Note that a 0 occurs in each denominator of a 
sum of m-ary products, and that the suffix in the 
numerator determines both the sign and the number 
of factors in each term of the series. This number is 
c alled the weight of the term and of the series and of the 
coefficient. 

In particular cases care is needed, since the notation 
Z is devised to help only in the general case when all the 
roots are distinct. For instance, if f(x) = (x— a) 2 (x— 6), 
thenZa = a+a — b = 2a— b, Za/3 — a 2 — 2a6, a/?y = — a 2 b. 

The relations are of immediate help in the solution of 
an equation whenever the roots are rational, or integral, 
or satisfy simple relations, as when they are in arithmetical 
progression. 

Examples. 1. If all the roots and coefficients are integers 
a 0 must be a factor of each a ( and each root is a factor of a n . 

Solve x z — 62c 2 -f 11* — 6 = 0 by trial. 

2. Solve x*—5x 2 —58x = 88. 

3. Solve x z — 3ax 2 -\- (3a 2 — b 2 )x— a(a 2 — 6 s ) = 0, having given 
that the roots are in arithmetical progression. 

4. Solve 8# 3 — 62a; 2 -f 78#— 27 = 0, given that the roots are 
in geometrical progression. 

5. Solve a 4 +# 8 +7a? 2 -f-4a;-|-12 = 0, given that one of the 
roots is a pure imaginary. 

(Take a, b , ±ic as roots. The roots are ±2 i, — ^(1 db^'v/lTT) 
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29. Allied Equations and Polynomials. It is an 

immediate consequence of the m - ary relations of 28 (3) that 

a 0 x n +a 1 x n ~ 1 y +a 2 x n ~ 2 y 2 + ... +a n _ 1 xy n ~ 1 +a n y n 
= «o {x—ay){x—fiy)...{x—\y) (1) 

This is called the homogeneous form of the identity due 
to factorizing the polynomial f(x ). Other variants of the 
same theme are obtained by putting y — — 1, thus, 

a 0 x n —a\x n - x -{-a 2 x n ~ 2 — . . . + ( — ) n a n 
= a 0 (x-) r a)(x-{-fi)...(x- 5 r A), . . . (2) 

or again by writing 1 for x and x for y : 

a 0 + a & +a 2 x 2 + . . . +a n x n 
= a 0 (l—ax)(l—px)...(l—Ax). . . (3) 

The zeros of this reversed polynomial are a -1 , /? _1 , 

A” 1 , that is, the reciprocals of the roots of f(x) — 0. 

Zero and Infinite Roots of an Equation, The 

equation f(x ) = 0 has a zero root if, and only if, a n = 0, 
as is obvious. When the last r coefficients vanish but 
the coefficient a n _ r does not, then there are manifestly r 
repeated zero roots. In this case the reversed equation 
just written down appears to have r infinite roots, since 
the root a of the original equation corresponds to a - * 1 
in the reversed equation. Similarly, when the first r 
coefficients of f(x) vanish but a r does not, we might 
consider /(#) = 0 to have r infinite roots. This is, however, 
a sophisticated way of viewing the situation, since the 
equation is no longer of the n th degree but of the {n—r) th 
degree. 

It is, however, more fruitful to consider a limiting case, 
and first to ask what is the effect on the roots if the 
coefficients vary slightly ? To answer this, let us consider 
the graph of f(x) when each coefficient is numerically less 
than €, supposed small. As on p. 18, we can take e small 
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enough to ensure that f(x) will be less than any assigned 
small quantity. Let the ordinate of such a curve be 
called 77 , so that 77 is small. Now take any finite values 
of the a { and alter them by adding these small coefficients. 
The resulting graph will differ vertically at (x, y) to an 
extent 77 , which may be positive or negative. Hence it 
must cross the axis of x at points which are near to the 
crossings of f(x) on the axis : that is, the real roots of 
f(x) = 0 are altered slightly when the coefficients are so 
altered. Each real root is in fact a continuous function 
of the coefficients : so too is a complex root, but we shall 
not attempt to prove this. 



Suppose that a 0 ^0 but that a 0 ->0, while a x -^a^ 0, 
a n ->b^0 simultaneously. Let us consider the effect on 
the reversed equation 

a n x n +a n _ 1 z n ~ 1 -\-a x x +a 0 = 0 . 

It will tend to a form for which one root is zero and the 
rest are non-zero. If one root a only of this equation 
tends to zero, then the sum of (?i--l)-ary products of roots 
contains one term which remains finite, while the 

rest of the terms contain the factor a and consequently 
tend to zero. If two roots tend to zero, every term of 
the (n— l)-ary product sum tends to zero. Hence 0, 
which contradicts the supposition. Accordingly if a 0 -*0 
but cq does not tend to zero, then one root and one only 
of the reversed equation tends to zero, and one and one 
only of the original equation tends to infinity. 

Example. Solve tx 2 +2x — 3 = 0. 
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^30. Further Symmetric Functions of the Roots. 

A second set of important symmetric functions of the 
roots of f(x) = 0 may be introduced as follows : 

Definition. — The coefficient h r of x r in the expansion 

a 0 {a 0 +a 1 x+...+a n x n )~ l 
= Kq^K^x-^-Ti^x 2 . . . -\-h r x r • . (1) 

is called the homogeneous product symmetric function of 
order r in the roots of the equation 

f(x) = a^ n -fcqz" -1 + . . . +a n _ 1 x -\-a n = 0. 

We can in fact write the left-hand expression in (1) as 

a 0 {a 0 (l -ax) {l— fix).. .}-* 

or (l+ax+a 2 x 2 +...){l+px+p 2 x 2 +. . (2) 

provided that each of ax , fix, ... is numerically less than 
unity, which is true for all small enough values of x. In 
which case by equating coefficients we have 

h 0 = 1, Aj = £a, h 2 — Za 2 +JCa/2, h 3 = i7a 3 +Za 2 j3+Zaj8y, 

and so on. Obviously h r is the sum of all distinct terms 
composed of exactly r factors a, /3, allowing repetitions. 
(These symmetric functions are sometimes referred to as 
sums of m-ary powers and products, sometimes as aleph 
functions of the roots, a name originally given by Wronski). 
The summations involve more terms, and are more 
complicated than those of the elementary symmetric 
functions, which we shall now denote by %, e 2 , and so on. 
But the e and the h functions are equally important, and 
between them there runs a close parallelism. (See for 
example, Aitken, Determinants and Matrices , pp. 113-121, 
where the elementary symmetric functions are denoted 
by «f) 

The number n H r of terms in the summations h r is 
given by putting a = j 8 = ... = A= 1, since each term 
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then reduces to unity. But this gives, instead of the 
identity (1), 

(1-X)-" = n H 0 + n H 1 x+ n H i x 2 +...+ n H r x r +... . (3) 


By the binomial theorem we find that 

„ g , = c-h— i)„, = • < 4 > 


which therefore gives the number of ways of choosing 
r things from n things when repetitions are allowed. 


^ ''31. Relations between the e and the h Functions. 

From 29 (3) we have at once 

F{x) = (l-ax)(l-px)...(l-\x) 

= 1— e^+CgX 2 — ...+( — ) n e n x n , 

while 

{F(z)}- 1 = {11(1— az)}- 1 = 1 +h 1 x+h 2 x 2 -\-...+h r x r -\-... . 

For convenience we may take e 0 = 1, h 0 = 1. Here we 
notice that the e-series is finite while the ^-series is infinite. 
On multiplying them together we have the identity 

1 =F(x){F(x)}-i = (l-e 1 x+...)(l+h 1 x+...), 

so that the coefficient of each positive power in the 
expansion must vanish. Accordingly we have 

e^ hi = 0, 

e 2 — cA+^2 = 0, . . . . (1) 

^ 3 — ^ 2^1 — ^3 “ 

relations which are called Wronski’s relations (Aitken, 
Determinants arid Matrices , p. 114). Solving from these 
each way in turn, we have 

= h 1} hi = Ci, 

^2 == ^ 2 > ^2 == ^1 ^ 2 > * 
e 3 = ^~2^ a +A 3 , h d = e\— 2^2+63, 


(2) 
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and so on. Hence either set may be expressed rationally 
and integrally in terms of the other ; and the inter- 
changeable r61es of the e’s and the h’a in such formulae 
cannot fail to be noticed. 

Example. Prove that 

a 0 e r = ( — ) r a r , . . . -{-a^ = 0 . 

32. The Sums of Powers Symmetric Functions. 

A third set of symmetric functions of the roots is the 
sums of powers : 

s j = Sol = a -f-jS-f- . . . -{- A, 

S 2 = 27a 2 = a 2 +/S 2 + ...+A 2 , 


8 m = Sa m = a w +j8 m +...+A m . 

It is possible to express each of these sums rationally 
and integrally in terms of the elementary symmetric 
functions e r . The results are embodied in Newton’s 
formulae (1707), which run as follows : 

°o 5 i+ a i == 
a Q s 2 +a l s l +2a 2 = 0, 

( 2 ) 


a 0 s v +a 1 8 v _ 1 +...-\-a v _ 1 8 1 +pa v = 0, 


where of course a v = 0 when p exceeds n. To establish 
these relations we calculate f f (x) in terms both of the 
coefficients and of the roots, and then equate corresponding 
terms in powers of x. Now 


Also 


f'(x) = naQX n - 1 +(n-l)a 1 x n -*+...+a n - 1 . 


m = 


f(x) 

x—a 


M , , /(*) 

x—f3 ' "‘~x— A 


(3) 


as is at once seen by differentiating the product form of 
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f(x) in 25 (p. 60). Again, by ordinary long division we 
have 

~~ “ = (a<F n -]-a l x n 1 -f ••• +<*»)/(# — a) 

= a 0 x n ~ 1 + {a Q a l + a 1 )x n ~ 2 + (a 0 a*+a 1 a + a 2 )x n - s + ...-\-(aQa n ~ 1 + ...) ( 4 ) 

without remainder. The process yields an apparent 
remainder /(a) which of course vanishes. (The quotient 
just written down is indeed an alternative way of writing 

a 0 (x-fi){z-y)...(x- A) 

which would result from dividing the factorized form of 
f(x) by x—a.) Let a series similar to (4) be formed for 
each root in turn. This gives n such series, and their sum 
will therefore be equal to/' (x), so that 

/' (x) = na 0 x n ~ 1 + (afa +7ia 1 )x n -' 2 + (a 0 s 2 +0& +na 2 )x n ~ 3 + . . . . 

For example, when n = 3, the three terms involving 
x n-2 gi ve a oa +a 1 +a 0 j8+fl 1 +«oy+ a i f° r coefficient, which 
is <Vi+3%. Similarly for the other terms. 

On comparing coefficients in the original and in this 
final form of f'(x) we have 

(71—1)% = a 0 s l J t~na v (n— 2)a a =a Q s 2 +%Si+tt%, etc. 

whence the first n of Newton’s formulae at once follow. 

Again, let F(x) = a Q x v +a 1 x v - l +...+a n x lc> - n ) where p 
is a positive integer greater than n. Since x p ~ n is a 
factor, and f(x) is its cofactor, we infer that the roots of 
F(x) = 0 are a, j3, ..., A together with p— n zeros. Hence 
the sums of powers of its roots are the same as for f[x). 

By writing out the p th formula in Newton’s relations for 
F(x) according to the above method we obtain 

But all the coefficients subsequent to a n are zero, so that 
V^+¥m+-+Vp- b = 0, . . (5) 

which is the form of Newton’s relations for p>n. 
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The following alternative form of Newton’s relations is 
easily deduced, since a 0 e r = (— ) r a r : 

s x — = 0, whence s x — Cj, 

s 2 -e 1 s x +2e 2 = 0, s 2 = ef-2e 2 , 


This shows that each of the s-functions may be expressed 
rationally and integrally in terms of the m-ary product 
sums of elementary symmetric functions of the roots. 
These equations were first given by J. Gregory in 1675 
for the case of the septimic equation (n = 7) as far as s 7 . 

The first set of relations in (6) between the s and the e 
functions is linear in both sets of variables, and may 
accordingly be solved for either set in terms of the other. 
When this is done determinantally we find that 


= < 

h> 
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h 
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2e 2 
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2! 
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2e 2 
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S 1 
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ol 

S 3 

S 2 

*i 



*i 

1 
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1 

. 

Q 
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1 
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1 

$ 2 

*1 

2 . 

— 

3e 3 

e 2 

*i 

1 

, e 4 = 

41 

S Z 

$ 2 

h 

3 


4e 4 

e 3 

*2 




*4 

$3 

S 2 



and so on. 

If we apply the above reasoning to the equation 
x n +a 1 £ w_1 +a 2 £ n ~ 2 + ...+a n = 0 . 


( 7 ) 


( 8 ) 


where a 0 = 1, we have at once Za 2 ~ s 2 — a\ — 2a 2 . 
If all the roots are real each a 2 is positive, so that a\'>2a v 
Therefore if af<2a 2 or = 2 a 2 there must be some complex 
roots. This proves the following result : 
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The equation (8) has at least one <pair of conjugate 
complex roots whenever 

(1) af<2 a 2 , (2) a\ = 2a 2 , (3) a x = 0, a 2 >0, 

(4) a 1 = a 2 = 0. 

It by no means follows that when a\>2a 2 all the roots 
are real, as the case x 2 +6# 4-11 — 0 shows. Here 36 >22 
but the roots are complex. 

33. Symmetric Functions in General. Any function 
of n arguments a, /?, ..., A which is unchanged by any 
interchange among the arguments is called symmetric. 
For example, x 2 y+xy 2 , sin(z+y), (a? — y) 2 are symmetric 
functions of x and y, but the function x—y is not, since it 
differs from y—x. We have already considered several 
symmetric functions of the roots of an equation, and 
the fact that the coefficients of 

« n +a 1 x n - 1 +...-fa n = 0 

are instances of such functions suggests their importance. 
Actually they play a fundamental part in many branches 
of higher mathematics, beginning with the theoretical 
solution of a cubic or higher equation. 

Fundamental Theorem on Symmetric Functions. Every 
rational integral symmetric function of the n arguments 
a, A can be expressed as a rational integral function 

of the n elementary symmetric functions e li e 2 , ..., e n . 

Proof. This follows by induction. Every such 
symmetric function S is a sum of terms ca r /K..A*, where 
c is a constant. Select from S a term of lowest degree 
in the n arguments, that is, one for which the sum 
r+s is least. Collect together all the terms which 
are symmetrical with it and necessarily have the same 
coefficient Cj, calling the sum of these c 1 S v Since S is 
symmetrical and S 1 also, then manifestly S—c 1 S 1 is so too. 
Repeat the process on this by collecting out of it a 
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symmetrical set c 2 S 2 ' and continue until the whole 
expression S is exhausted. This expresses S as a finite 
sum of monomial symmetric functions such as S v Thus 

S = c 1 /S f 1 +c 2 /8 r 2 +...+c Jfc AS jfc , where S x = 27a f /2 a ...A*. 

For example, when 

S = rc 3 — 3xy+x 2 y+y* + 2x + 2y+xy 2 , 

Si = a?+y, = xij, S z = x 2 y+xy 2 , S t = z 3 + 2 / 3 . 

Among the terms such as 5* several may have the same 
degree p, as S 3 , $ 4 in the above example. When this 
happens we arrange these terms in ascending lexical order 
of the partition {rs...t} belonging to the positive integer p, 
lower indices taking precedence over higher (see p. 2) 
and r <$<...<£. Corresponding to each partition there 
is one monomial symmetric function, and vice versa. 

If we can prove the theorem for a monomial symmetric 
function of any degree p and any set of indices { rs...t } 
it must then be true for every S. We prove it for 
monomials by induction, assuming it true for all monomials 
of lower degree than p, and for those with an equal p but 
an earlier partition in the lexical order. 

Write = (a/?...A) r (jSy...A) s - f ... = T, which can 

be done since we have assumed, without loss of generality, 
that r 

For example, a 2 /9 5 y 5 8 7 = (a j8yS) 2 (£y8) s 8 2 . 

Let the elementary symmetric functions to which these 
groupings of the n arguments belong be e n , e n _ v etc. 
Then T is a term in the expansion of 

E _ e r p 8-r 
& — Vn-l " 1 

in terms of the arguments a, jS, ..., A. 

For example, <x 2 £ 5 y 5 8 7 is a term in 

e2e| e 0 e 2 = (Zapy$) 2 (Zapy) z (Za) % . 
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But since E is a symmetric function it too may be expressed 
monomially in lexical order 

E — c 1 E 1 -\-c 2 E 2 -\-,..-\-c k Ej ey 

where E x is necessarily the monomial due to the set of 
indices {r«s...£}. (In the example E x consists of terms of 
indices 2, 5, 5, 7 only.) Hence E x — S x and 0. On 
dividing by the non-zero constant c\ we express E x or S x 
rationally and integrally in terms of E and earlier 
monomials E 2i E 3 , E k for which the theorem has been 
assumed true. Since E is itself in the required form this 
proves the theorem by induction. The reduction is unique 
(p. 148). 

34. Further Theory of Symmetric Functions. 

Taking the case of the third order, let 

e t = a+/5+y, e 2 = £y+ya+a/3, e 3 = a£y, 

so that a, /?, y are the roots of the cubic 
x 3 = e x x 2 — e 2 x-j-e 3 . 

Multiply this by x and substitute for x 3 in terms of lower 
powers : then 

x 41 = e 1 x s —e 2 x 2 -j-e 3 x 

== {e\—e 2 )x 2 —(e x e 2 ~e 3 )x+e 1 e v 

Similarly, x 5 is expressible in terms of x 3 , x 2 , x and therefore 
of x 2 , x 1 , x° ; and so on. We infer that, if x = a or /J 
or y, we can always express a power of x as a quadratic 

x k = E x x 2 +E 2 x+E 3 , x = a, y, . (1) 

where E ly E 2 , E 3 are polynomials in e^, e 2i e 3 with integral 
coefficients. 

Now take any three positive integers l, m, n and form 
the determinant 

a 1 y l 

a m B m y m . 

a- % r 


A = | a l /3 m y n | = 
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We can therefore write this as 


A = 


Ei a 2 -\-E 2 a -\-E 3 
-f i<i 2 -\-F 2 a -\-F 3 
Qia 2 +G 2 a+G 3 


Ef+EJ+E, 

F l p 2 +F 2 P+F 3 

GJ'+Gf+Gn 


Eiy*+E 2 y+E 3 
F 1 y 2 -\-F 2 y F 3 
Gi y 2 +Q 3 y+Q 3 


E 2 E 2 E 3 
F i F, F 3 
<?i 0. G» 


a 2 j8 2 y 2 
a y 

1 1 1 


= J(e)i(a/Jy), 


let us say, by the multiplication theorem of determinants 
(Aitken, Determinants and Matrices , p. 80), where the 
nine quantities E , F , G are polynomials in Cj, e 2 , e 3 with 
integer coefficients, and hence A(e) is so too. 

If a, /?, y all differ the cofactor determinant (a 2 ^^ 0 ] 
is the difference-product (a— P)(a — y)(/3— y), and the 
quotient 

|a^8 m y n | -f- la^y 0 ! 


is identically equal to J(e) which is a symmetric function 
of a, j8, y. We call this quotient a bialtemant (Ibid., 
p. 113) of order three. Exactly similar treatment applies 
to any number of roots a, j8, A. A bialtemant is 
characterized by its arguments a, j3 , y, ... and its positive 
integral indices Z, ra, n, .... Thus 

Every bialtemant is expressible as a polynomial in the 
elementary symmetric functions Cj with integer coefficients. 

Next, a function <f>( a, /}, y) which changes sign whenever 
any two of its arguments a, jS, y are interchanged is called 
an alternating function of its arguments ; and similarly 
for any number of arguments. For instance, 

p, y) = —<f>(P, a, y) = <f>(P> y, a) = etc. 

Hence the square of an alternating function is a symmetric 
function ; so also is the product or the quotient of two 
alternating functions. But if )S, y) is symmetric, 
the product alternates ; in fact 

^(a, P, y) <p( a, p, y) = —<f>(P, a, y) <p(P, a, y). 
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The alternant |a 2 /3 1 y°| is an alternating function, and 
so is the more general \a l ft m y n |. Their quotient, which 
is a bialternant, is then symmetric. 

Alternative Proof of the Fundamental Theorem 
for Symmetric Functions. The above considerations 
lead to a notable proof of the theorem that every monomial 
symmetric function of a, ft y, ... is a polynomial, with 
integer coefficients, in e v e 2 , e 3 , ... . 

For let 27a Pfi q y r = ip be such a monomial symmetric) 
function. Multiply it by 

<P == \a*py\ = z±* 2 PY- 
Then ^ = 27±a *+*p+Y+2± a * +t P* 1 Y r +”-9 

where the whole summation extends to six terms due to 
permuting a, ft y among themselves, and again to terms 
due to permuting p , q , r. Hence 

= \a p+ 2 j3 q+1 y r \ + |a fl+2 ft* +1 y r | + ... . 

Divide throughout by <p = | a 2 /? 1 ^ 0 |, and this at once 
expresses ip as a sum of bialternants such as 
| a *+ 2 j8«+y|/| a 2j3y| > 

each of which we have seen to be equal to a polynomial 
in the elementary symmetric functions e and with integer 
coefficients. 

The above method is general and applicable to n 
variables a. If ip — Ua v ^ q y r ...X 8 is a monomial symmetric 
function where p-\-q+r~\-,..-{-s = n and some of the 
indices may be zero, the number of terms in this summation 
is the number of distinct permutations of the n letters 
p, q, ..., e taken all at a time. 

An Alternative Based on Cauchy’s Proof. Let us 

segregate one root a and let = a+/ l5 e 2 = a /i+/ 2 > 
e 3 = a/ 2 +/ 3 , ..., e n = a f n _ v so that the expressions 

fi = P+Y+—9 f 2 = Py+P$+---> •••> fn-i = ft'— A 
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are the elementary symmetric functions of the n — 1 
remaining roots. If ip is a monomial symmetric function 
of the n roots, arrange it as 

where each ip { is a function of the n— 1 roots, in which it 
is necessarily symmetric. Assume the theorem true for 
these n — 1 roots, so that each ip is expressible as a function 
of the But 

f x = e 1 — a, / 2 = e 2 — a/* = e 2 — ae 1 +a 2 , and so on. 

Hence each ip is also a polynomial function of the e,- and 
of a ; let us say finally that 

if; = Eq -\-E 1 a ~\-E 2 a 2 + . . . -\~E 

where eachi? is a polynomial in thee,-, and by (1) p. 77 no 
power a v higher than a n_1 is needed. 

By symmetry exactly the same equation is satisfied 
by jS, y, ..., A, so that 

tfj — E 0 +E 1 fi +E 2 /J 2 + • • • e f c * 

Either all of E 0 —ip , E v E 2 , E n _ x vanish, or else by 

elimination |a 1 /J 1 y 2 ...A n | = 0, which cannot be true since 
this determinant is equal to the difference-product of 
arguments a, /?, ..., A all different. Hence ip = E 0 and 
all the other E { vanish. This expresses ip as a polynomial 
in the e,-, and the rest follows by induction. 

Examples : 

1. Evaluate $ 8 , s 6 , s 7 for x z — 5x -f 1 = 0. 

2. Evaluate Za~ z for 4z 4 — 12a; 3 -f 5a: 2 *— 1=0. 

3. Prove that 


Sq Si s 2 s s 

^4 

S% S$ <?4 

S d $ 4 s 6 s 9 


1111 
a p y 8 
a 2 p 2 y 2 8 2 
a 3 p 3 / S 3 


2 



CHAPTER VI 


TRANSFORMATION AND NUMERICAL SOLUTION 
OF ALGEBRAIC EQUATIONS 

35. Increasing or Decreasing all the Roots of an 
Equation by the Same Amount. Given an equation 

f{x) = a 0 x n +a l x n - y + . . . +a n _ y x +a n = 0 . ( 1 ) 

we shall seek for the equation 

F(y) = b^+b 1 ^- 1 +...+b n _ 1 y+b n = 0 . ( 2 ) 

each of whose roots is h less than the corresponding root 
of f(x) — 0. In other words, we transform f(x) to F(y) 
by taking 

x = y+h. 

In the factorized form we shall then have 

f(x) == a 0 (x— a)(x—p)...(x—\) 

= a 0 (y +h —cl) (y +h —ft ) . . . (y +A — A) 

= F(y)- 

The roots of the equation F(y) == 0 are clearly a— h, j 8—h, 
and so on, and these are of the required form ; and so we 
have found the required polynomial F{y) i only in factorized 
shape. Without yet knowing the roots for x we can 
obtain F(y) directly by writing 

f(x) = a 0 {y+h) n +a 1 (y+h) n - 1 ...+a n , . . (3) 

expanding each term and then rearranging. In practice 
this is done by Horner’s method (16) p. 32. 

81 


F 
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Examples. 1. To reduce each root of 


# 4 — 2# 3 + 3#- 

-4= 0 by 3. 

-2 

0 

3 -4(3 

3 

3 

9 36 

1 

3 

12 j'32 

3 

12 

45 | 

4 

Is 

57 

3 

21 


7 

36 

* = y+3, 


10 


V A + 1 02/ 3 +362/ a +57y-|-32 = 0 . 


2. Increase the roots of — 19a: 2 — 106# — 120 = 0 

by 3. Hence solve the equation : (y 4 — % 3 — y 2 -f-8y = 0). 

3. Decrease the roots of x*-\-2x — 2 — 0 by 0*7. 

(y 8 + 2-lt/ 2 + 3-47y-0-257 = 0). 


36. Removal of the Second or Third Term of an 
Equation. If the relation 35 (3) is expanded, the co- 
efficients of y n and the next two highest powers of y are 
respectively 

a 0 , na^i+a^y \n{n— \)aJi 2J r(n— l)a 1 h-\-a 2 . (1) 

If we choose h = —aJnaQ the second term in the resulting 
equation for y disappears. If instead we choose h to 
make the quadratic expression in n y the coefficient of 
y n ~ 2 , vanish, then the third term of the equation for y 
disappears. Usually the same value of h fails to make 
both terms disappear and we content ourselves with the 
first step only. Since ajna 0 is rational the step is readily 
taken, and the resulting equation is manifestly simpler 
than the original. 
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Examples. Remove the second term from the equations 

# 8 -f Qx 2 — lx — 4 — 0, 2x 3 +x 2 — l — 0, a 4 — 8:r 3 +# 2 — # + 3 = 0. 

I 1 53 

(Here h = — 2, — 2 ; y 3 — l$y+2G = 0, 2y 3 — -y — — = 0, 
o b o4 

y 4 — 23y 2 — 61y — 43 0). 

Increasing or Decreasing all the Roots by a 
Constant Multiple. Let x = hy ; then 

f(x) = a 0 k n y n +a 1 Jc n ~ 1 y n - 1 +...+a n = 0. 

This is of special numerical use when & is a power of 10. 
For example, we can regard the equations 

z 3 +2-Lr 2 +3*47;z-0-257 = 0, 

z 3 +21z 2 +347a;-257 = 0, 

x 3 +0* 21x 2 +0- 0347a; —0* 000257 = 0 

as essentially the same. If the roots of the first are 
a, /?, y those of the second are 10a, 10/?, 10y, and of the 
third are a/10, £/10, y/10. 

37. Horner’s Method of Solving an Equation. 

An equation may always be solved to any desired degree 
of approximation, as far as its real roots are concerned, 
by Horner’s method. This is best explained by an actual 
example : we shall take f(x) = x 3 — 4x 2 +6x— 18084 = 0. 

Here /( 0) is negative, so that the graph of f(x) crosses 
the axis x = 0 below the origin, but /( 100) is positive, and 
the graph crosses the parallel x = 100 above the axis 
y — 0. Since the graph is continuous it must cut y = 0 
at least once between x = 0 and x — 100. By trial we 
find that /( 20) <0, /(30)>0, so that a root lies between 
20 and 30. Now reduce all roots by 20. This gives an 
equation y 3 +56y 2 +1046y = 12464 which must therefore 
have a real root between 0 and 10. Reduce the roots 
by 8. (The choice of 8 will be explained presently.) 
The new equation is z 3 +80z 2 +2134z — 0, where the 
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balance in the final column of the Horner scheme is zero. 
Accordingly z = 0 solves it : and x = 20+8 = 28. 


1 


-4 
20 
I 6 
20 
36 
20 
56 
JB 
64 
_8 
72 
8 

80 


6 -18984 (28 

320 6520 


326 -12464 

720 12464 


1046 

512 

1558 

576 

12134 


0 


0 = z — y~ 8 — x— 28. 


The actual equation for z is z 3 +80z 2 +2134z = 0, which 
shows that z = 0, or else must be complex since 
80 2 < 4(2134). So the roots for x f being 28 greater than 
for z, are 28 itself and two complex values also. 

How, it will be asked, was the figure 8 chosen for the 
reduction of y ? Had 9 been chosen the balance in the 
third column would have been increased by more than 
512 and that of the fourth would have become positive. 
This last fact shows that /(20+9)>0 and therefore 29 
is too large a number. Also 7 would have been too small, 
for, except at the initial step, we always choose the highest 
digit which will reduce the numerical balance in the last 
column as much as possible without changing its original 
sign, from — to + or from + to — . 

Notice that in the working arrangement, as exemplified 
above, the successive reductions can be carried out as 
successive continuations of the first one. 

More often than not the root is a non-terminating 
decimal. In such a case the balance in the final column 
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is never zero. The following worked example is chosen 
for comparison with the preceding. 

Example. Solve x 3 — 4x 2 -j- 6# — 20,000 = 0. 

1 -4 6 -20000 (28-4 

20 320 6520 

16 
20 

36 I 1046 -1016-000 

20 I 512 866-464 

56 1558 -149-536 

8 576 

64 2134-00 

8 32-16 

72 2166-16 

8 32-32 

80 7 0 2198-48 " 

0-4 

80^4 

0-4 

80-8 

0-4 

8T2 

The method is the same as before, but at the third stage 
a reduction of the root by 0*4 is effected (trial shows that 
0*5 is too large : it would make a positive final balance). 
Since the operation is persistently that of adding 0-4 times 
the balance, it is hardly surprising to note that the balances 
in the second and third columns are altering relatively little. 
The change is less at each successive stage : at the next, 
with 0*06, we should have 

81-20 2198-4800 -149-536000(0-06 

0-06 4-8756 132-201336 

81-26 2203-3556 -17-334664 

0-06 4-8792 

81-32 2208-2348 

0-06 
81-38 


326 -13480 

720 12464 
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This leaves a negative balance in the final column : 0*07 
would have given a positive. The next approximation is 
0*007 which adds on less than 0*6 in the third column. The 
fourth decimal figure in the answer would add on even less, 
but the number of significant digits in the balances is becoming 
unwieldy. Let us therefore redress the balances by deleting 
digits from the right — one from the last column but one, 
two from the last but two, three from the last but three, and 
so on. We then have 


17*334664(0*007847745 
15*461628 
-1*873036 
1*767544 
-0*105492 
88380 
-0*017112 
15466 
' 1646 

1547 
99 
8_8 
11 
1JL 

"o 

x = 28*467847745... . 


81*38 2208*2348 

0*569 
2208 *804 
0*569 

8X 2209*373 
0*06 
2209*43 
0*06 
2209-49 


Considering the precision of the result, which gives 
x to ten significant figures, the labour involved is by 
no means unreasonable. We may note a few general 
principles : 

(i) After the first and perhaps the second digit has 
been found the remaining figures are successively given 
with comparative ease. Each new figure is roughly given 
on dividing the balance in the final column by that in 
the next preceding column. (For example, the third 
significant figure 4 in the illustration above is suggested 
by 1016000/213400.) But we must allow for a possible 
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increase in such a divisor ; as in this case 213400 rises 
to 216616. 

(ii) The decimal point may be omitted except in the 
answer. Instead, at each new stage the balances may 
be multiplied by 1, 10, 100, 1000 and so on, from left to 
right. Thus we attach one, two, three ciphers to the 
balance in columns two, three, four and treat them all 
as whole numbers. This is illustrated in the example 
which now follows. 


Example, a; 3 — 0* 4a; 2 +0-06^ — 20 = 0. 


1 


♦800 

4 

804~ 

4 

808 

4 


812 

B 


-0-4 

0-06 

2 

3-2 

Fe 

3- 26 

2 

7-2 

rr 

10-46 

2 

1046 

5-6 

512 

*56“ 

1558 

8 

576 

64~ 

213400 

8 

3216 

72 

216616 

8 

3232 


21984 B 
_ 49 
2203? 
49 


22083 


-20 (2-846785 

6-52 
-13-48 
— 13480* 

12464 

-1016000* 

866464 

-149536 

132204 

-17332 

15456 

^1876 

1766 

'-110 

110 


(* As in (ii) above.) 


(iii) Contraction may alternatively be employed, by 
deleting figures as already explained. It uses up the 
balances and produces a result to about twice as many 
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significant figures as were reached at the stage when 
contraction began. 

(iv) At the 7 of the third decimal place in x on p. 86 
the balances were 1, 81*38, 2208*2348, etc. Contraction 
gave 81, 2208*234, but 569 was entered as the approxima- 
tion better than 81x7—567. It was obtained by paying 
attention to the figures 38 just deleted. Similarly, the 
deleted 8 in the third column contributed to the next 
balance 2208*804. 

Again for comparison almost the same equation is 
dealt with on p. 87, but contraction takes place after three 
stages. 

38. Rational Roots of an Equation. If the co- 
efficients of an algebraic equation are complex we may 
replace the equation by one of higher degree whose 
coefficients are real and whose roots include all the original 
roots (5, Ex. 2, and 27, p. 64). If these real coefficients 
are irrational we may approximate to them by decimals to 
any degree of accuracy, in which case the corresponding 
roots are approximations also of any degree of accuracy 
(p. 69). Multiplication by a suitable power of 10 then 
removes the decimals and yields integer coefficients. 
Hence the limitation of the discussion to rational equations, 
with integer coefficients, is less of a restriction than at 
first sight appears. 

For example, \/2x 2 — (\/3-f|)^+4 = 0 
is approximately equivalent, with fair accuracy, to 

l*414# 2 -2*232#-M = 0, 

and with still more accuracy, to 

1 *4142 1# 2 — 2*23205# + 4 = 0, 

that is, to 

141421# 2 — 223205# + 400000 = 0. 
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Equations with Integer Coefficients. We may 
always reduce a given equation to the form 

x n -\-a l x n ~ 1 +a^x n - 2 -\-...-\-a n = 0. . (1) 

A case of special interest arises when the coefficients a lf 
a 2 , ..., a n are all integers, for the roots of the equation 
are then (6) p. 11, algebraic integers. If all the coefficients 
a 0i a lf ..., a n of an algebraic equation are integers the 
reduction to the still more special form (1), in which 
a 0 = I, can always be carried out ; for we have only to 
take z = a Q x as a new variable to obtain an equation in 
z, namely, 

z n +a 0 a 1 z n - 1 +ala 2 z n - z +...+a”a n = 0, 
which is evidently of the form (1). 

For example, if lx 2 ~Qx — 5 = 0, and z = lx, wo have 
49x a — 42cc — 35 = 0, that is, z 2 — 6z — 35 — 0. 

Theorem. Each rational root of an equation with 
integer coefficients which is in the form (1) above is necessarily 
an integer. 

Proof. Let p/g, rational and in its lowest terms, be a 
root of equation (1). Then 

(W?) n 'Mi(W?)" _1 +•••+«« = o. . . (2) 

Multiply through by g" -1 . Then every term is a whole 
number except the first, which is p n /q, a fraction. This is 
impossible : hence pjq must be a whole number. 

To Find the Rational Roots of a Rational Equation. 

If a Q ^\ we first bring the equation to the standard form (1). 
By the factorized form of a polynomial (25) p. 60, a n is now 
the product of the roots of the equation, and these if 
rational are integers, as we have just seen. Thus the only 
possible rational roots are to be found among the factors 
of a n , with positive or negative sign affixed, and these 
must be examined one by one. 
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Example. 9x 4 + 6o; 3 4-19^ 2 + 6a; — 16 = 0. 

Instead of multiplying by 9 s it is enough here to multiply 
throughout by 9 and to put 3x = z. This gives 
f(z) = z 4 + 2z 3 -f 19<5 a -fl8z- 144 = 0. 

The factors of 144 are il> zb 2, ^3, dz4, zb^> zb zb 9, zb 12, 

etc. Actual roots are 2, —3, so that (z — 2)(z + 3) is a factor 
of f(z). The other factor is z a -fz-f 24, which leads to complex 
2 

roots. Hence x — —1, and two complex values. 

3 

39. Other Methods of Solving Equations. 
Iteration. If the equation f(x) = 0 can be readily 
thrown into the form x = </>(z) it is sometimes possible to 
solve it by iteration. Geometrically the problem consists 
in finding the x coordinate of a point Z common to the 
line OP and the curve LM , whose equations are y = x, 
and y — <f>(x) respectively. 



Take any point A on OP and draw a staircase or spiral 
polygon, with lines alternately parallel to Oy and Ox 
and comers alternately on the curve and on the line OP. 
In both figures the points A, B, C f ... tend to the desired 
limit Z , the point where OP cuts the curve LM. 
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Now the coordinates of B, C , ... are easily found from 
those of A. If A is (a, a) 3 B is ( b , 6), etc., then evidently 
the ordinate y of B is that of H , the point (a, <}>(cl)). Hence 
b = <f>(a) y c = <f>(b) 3 d = (f>(c), .... 



Whenever the sequence a, b,c, d, ... tends to a limit a 
it furnishes a root a of the equation f(x) — 0. 

Example. — 2x — 5 = 0. 

Take this in the form x = ^/(2a;-|-5) and let a = 2. 

Then 6=^/9= 2-08.. and so 26 + 5 = 9-16.. 

Thus c = lj/9-16 = 2-092.. and 2c + 5 = 9-184.. 

d — iJ/9-184 = 2-0942.., and in the same way 
e = 2-0945..,/ = 2-09455.. 

Actually o = 2-09455148... 

The process is available whenever the repeated 
calculation of <j>{ x) can readily be effected. While the 
choice of the initial value a is largely arbitrary, it may 
quite possibly lead to a divergent sequence. For example, 
had A been taken to the left of the point K in Fig. 1 
it would have led to a staircase diverging from K , as 
experiment will at once show. Unless the curve were to 
recross the line PK produced no limit would then be found. 

Also in Fig. 2 if the curve at Z crosses OP at 45° down- 
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wards, that is, if <£'(a) = — 1 at (a, <f>(a)) then difficulties 
may arise. Indeed, if the arc LM is symmetrical with 
regard to OP, cutting OP at right angles, the spiral polygon 
becomes a square on diagonal AB, endlessly repeated : 
and O coincides with A . 

The method of iteration has the agreeable property 
that it is not seriously vitiated by an intermediate mistake 
or two committed in the calculation at any stage, provided 
that the sequence of results points to a clear limiting 
value ! Had c been taken as 2*082 in the example this 
would merely have been equivalent to choosing a value 
of a rather less than 2. The values of d , e, / would still 
have 2*09 as their leading digits. 

This pleasing feature makes the method popular. For 
a more detailed account of it the reader should consult 
Whittaker and Robinson, The Calculus of Observations , 
pp. 78-84. Tho method is due to James Gregory and 
Michael Dary independently in 1674. The same principle 
was also used by Newton (1707). 

40. The Approximate Method of Newton. If the 

curve y = f(x) is known to cut Ox near a point x = a, 
we may proceed as follows : Choose a point A near to Z 
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on the axis of x : draw the ordinate AP to cut the curve 
at P, and the tangent PB to cut Ox at B . Draw the 
ordinate BQ to cut the curve at Q and the tangent QC 
to cut OX at C. Repeat the process. If y — f(x) denotes 
the curve and if both f'(x) and f”(x) do not vanish in the 
range AB in the first figure or AZ in the second, then this 
procedure, repeated again and again, yields points A, 
By Cy ... which approximate to the point Z t as the actual 
construction shows. 



In this way we find a series of successive approximations 
to the root a of the equation f(x) — 0, which lies near a 
given value x — a. 

Analytically, let y—f(a) =f'(a)(x— a) be the equation 
of the tangent at P, the point (a, /(a)). Then OB or b 
is given by putting y — 0, so that 

b = a— f (a) If (a). 

Similarly, c = £—/(&)//'(&), and so on (see also p. 159). 

Example. x*—2x—5 = 0. 

Here/(2) = — 3,/(3) — 16, so that a root lies between 2 and 3. 
Put a = 2. Then b = a — (a 3 — 2a — 6)/(3a 2 — 2) = 2*1. 
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Also 


c = 2-1 


2*1 8 — 4*2 — 5 
3 X 2*1 2 — 2 


= 2*1 - 


9-261-9*2 


= 2*0946. 


d = 2*0946 


13*23-2 

2-0946 3 -4*1892- 


3 X 2-0946 2 — 2 
= 2*0946 —0-000541 55/1 1*1 62 
= 2*094551483, 


where the last figure 3 is approximate. 


As the above example may suggest, Newton’s method 
in favourable cases is powerful and rapidly convergent. 
It is applicable not merely to algebraic equations, but to 
transcendental equations, provided that f'(x) is readily 
calculable : for example, the equation x — tan x can be 
solved with sufficient accuracy, provided that we have 
tables from which see 2 # can be interpolated. The method 
is also iterative, and so enjoys the advantage already 
mentioned, namely, that it is not vitiated by an error at 
an intermediate stage. Newton elaborated a geometrical 
form of this method in the Scholium of Proposition 31 
Book I of the Principia (1687), where he applied it first 
to the equation x—e sin x = N, and next to e sinh x—x = N. 
These equations arose out of Kepler’s Problem, to find the 
position of a planet at a given time in an elliptic or hyper- 
bolic orbit of eccentricity e. 


Examples. 

1. Show that e® — 1 = 2x has one positive real root, and 
approximate to it by Newton’s method. [Take a — 1.] Then 
b = 1*392, c = 1*275, d = 1*256. 

2. Determine the real roots of x 6 — l(Xr 3 +4 = 0 to two 
significant figures. 

Horner’s method leads to the values —3*182034, 0*751234, 
3*141818. Newton’s method, with a = 1, b — *8, c = 0*7538 
roughly approximates to the smaller positive root. 



CHAPTER VII 


THE LOCATION OF THE ROOTS OF AN EQUATION 

41. The Significance of the Sign of a Polynomial. 

Let us consider the graph of the real polynomial 

y = a 0 X n +a 1 X n ~ 1 + ...+a n = f( x ). 

It is a continuous line which crosses each line parallel to 
the axis x = 0 once, and it crosses the axis y — 0, or 
touches it, at those values of x which give the roots of the 
equation f(x) = 0. If /(a)< 0 and /(&)> 0 we say that y, 
or f(x), changes sign as x passes from a to b. Now y is 
negative at x = a, changes continuously in value as 
x changes, and becomes positive when x = b. Unless y 
vanishes for some value of x between a and b this could 
not happen. Hence there is a root between x — a and 
x — b. Similar reasoning applies if/(a)>0 and/(6)<0. 

Now consider the graph itself with its possible ups and 
downs. Let A be the point on the graph where x = a, 
and B be the point where x = 6. 



If f(a)< 0, /(&)> 0 there may be one real root between 
A and B (as when B is at B x in the figure) : or three real 
roots (as when B is at B 3 in the figure), but certainly not 
two real roots, or four real roots. 

06 



96 


LOCATION OF ROOTS 


If the graph touches the axis y = 0 we can regard 
the point of contact as the point of coincidence of two 
roots of the equation. 



If the graph touches the axis y = 0 at an inflexion, 
we can regard the point of inflexion as the point of 



coincidence of three roots of the equation. And so on. 
The characteristic property is therefore as follows : 

Theorem. An odd number or an even number of real 
roots of an equation f(x) = 0 lie between two values x = a 
and x == b, according as f(a), f(b) differ in sign or have 
the same sign. 

Corollary. An equation of odd degree must have at least 
one real root ; and equation of even degree need not necessarily 
have a real root. 

For if x is large f(x) behaves like a 0 x n (cf. p. 21), 
where without loss of generality we may take a 0 — 1. 
If x<0 so is x n when n is odd. We write /( — oo)<0. 
And if ce>0 so is x n , and so /(oo)>0. Hence f(x) changes 
sign at least once, and there is a real root. But if n is 
even/( — oo) and/( + oo) are both positive and the function 
f(x) may indeed never become negative. 

Examples, x 2 -j-4, £ 4 -f- 2x 2 -f 4 are functions that cannot be 
negative for any real values of x. 
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42. Upper Bound to the Real Roots, and Newton’s 
Rule. Let us take the equation with a 0 = 1, so that for 
large enough values of x the function f(x) is positive. If 
b is such that f(x)> 0 whenever x>b we call b an upper 
bound (it used to be called an upper limit) to the real roots 
of the equation f(x) = 0. Manifestly it is a help, in the 
search for the roots, to ascertain that they are confined 
below such a definite value 6. Obviously no root can 
exceed 6, a value of x from which onwards f(x) is always 
positive. There may also be numbers less than b which 
may yet exceed each root of f(x) = 0 ; the bounds may 
be drawn more tightly. 

Newton gave a rule to the following effect : if f(b), 
f'(b), f^b), ..., f (n) (b) are all positive , b is such an upper 
bound to the real roots of the equation f(x) = 0. 

The rule follows at once from Taylor’s formula 

f(x) =f(b+x-b) — /(&)+(#—&)/'(&) + --.+^—-—/ (n) (^)> 

n I 

since the right-hand expression is positive when f(b) and 
its n successive derivatives are positive, and x>b ; so that 
f(x)> 0 whenever x >b. 

Such a bound b is of course not unique, for any point 
to the right of b on the axis of x would answer equally 
well. 

Examples. 

1. f(x) = x 3 — 9:c 2 -f-4a;-t-90 == 0, 
f'(x) = 3x 2 — 1 8a; + 4, f"(x) = 6*-18. 

Here f"(x)> 0 if #>3 and f'(x)< 0 at x = 3, 4, 5, but 
/ / (6)>0, /(6)>0 ; hence no real root exceeds 6, which is an 
upper bound. 

2. For z 3 — 12* a -f 57o;-94 = 0, 6 = 4. 

Lower Bound to the Real Roots. The equation 
x n f(x~ x ) = 0, which reverses the given equation, has for 

G 
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its roots the reciprocals a” 1 , /J” 1 , A -1 of the roots a, 

A of f(x) = 0. For 

x n j{x~ 1 ) = x n (a Q x- n +a 1 x- n + 1 +...+a n ) 

= a 0 + a i*+---+ a nS n 

= ao(i-aa:)(l-^)...(l— Ax), . . (1) 

and the roots are a^ 1 , jS” 1 , ..., A -1 . Now if no root lies 
between b and +oo, when b is positive, then no reciprocal 
lies between b* 1 and zero. Accordingly, if 6 is a positive 
upper bound of the reciprocal equation, b~ l is a lower 
bound for the positive roots of the original equation. 

Example 3. The reciprocal equation to f(pc) =0 in 
Ex. 1 is 90o: 3 -4-4^ 2 — 9^-f 1 = 0, 

and the derivatives are 

270a; 2 -f 8#— 9 and 540a; + 8. 

Now x = 1/5 renders these three polynomials positive, and 
so no root lies between 1/5 and -f- 00 * so that no root of the 
original equation lies between 5 and 0. We infer from 
Examples 1 and 3 that real positive roots of 

x z — 9a; 2 -f 4a; -f 90 = 0 

can only lie between x — 5 and x = 6. 

A lower bound to all the real roots can be found by 
considering the upper bound of /( —x) = 0, the roots of 
which are —a, —/?, ..., —A ; and, finally, an upper bound 
to the negative roots is given by considering x n f( — x~ x ) = 0, 
the roots of which are —a -1 , --jS" 1 , —A -1 . 

Example 4. If <f>(x) = x 3 -\- 9a; 2 + 4a? — 90 = 0, 
then <f>(—x) = 0 is the equation of Ex. 1. The derivatives 
are positive for x>0, but <£(0)<0, <£(2)<0, <£(3)>0. Hence 
x = 3 is an upper bound ; and no root of the original equation 
of Ex. 1 is less than —3. 

Again, when /(o)< 0, /(6)>0, and f'(x)>0 for a<x<b f 
then f(x) = 0 has exactly one root between a and b ; for 
the graph must be climbing all the way between x — a and 
x = 6. In Ex. 4 there is exactly one root between 0 and 3. 
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43. Relative Location of Roots of an Equation and 
of the Derived Equation. The following important and 
useful theorem, first published by Rolle in 1689, is true 
for functions more general than polynomials, but is applied 
here to algebraic equations. 

Rollers Theorem. Between any two consecutive roots of 
the algebraic equation f(x) = 0 there lie an odd number of 
roots of the equation f'(x) = 0. 

Since the graph y = f(x) is continuous, f(x) must have 
a maximum or minimum at least once between any two 



values x — a and x = f3 for which f(x) vanishes. In fact 
f(x) cannot always increase as x changes from a to 
if f[ a ) = f{j 3) : nor can it always decrease. Again, at such 
a maximum or minimum the gradient of the tangent is 
continuous, so that f'(x) exists but is neither >0 nor <0. 
It can only be zero. 

Once more, if an even number of zeros of f'(x ) lie between 
two points P and Q on the curve, then the gradients of 
the tangent at P and Q are both positive or both negative. 
Therefore if the curve is ascending through P it is ascending 
through Q : and this is impossible when P and Q lie at 
consecutive roots of the equation f(x ) = 0. Similarly for 
descending values. Hence f'(x) vanishes an odd number 
of times between two consecutive roots of f(x) — 0. 

Corollary . The roots of f "(x) = 0 separate those of 
f '(x) = 0 in a similar manner. 

* For an illustration see p. 19, Ex. 1. 

44. The Harriot-Descartes Rule of Signs. We now 
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proceed to investigate a remarkable theorem, implicit in 
the work of Harriot but first used explicitly by Descartes 
(1637), which limits the number of positive or negative 
roots of an equation ; but first we require some preliminary 
definitions. 

When a polynomial a Q x n +a 1 x m +a 1 x*+...-\-a r x (l is 
arranged in descending order of index and has non-zero 
real coefficients, a continuation is said to occur whenever 
the signs of two consecutive terms are the same, and a 
change is said to occur whenever these two signs are 
contrary. Thus in the polynomial 

a* —3a; 7 +4a; 5 -a* -2a; 3 -3a; 2 +2a; +3 

three continuations occur, following the terms —a; 4 , 
—2a; 3 , +2a; respectively, and four changes of sign occur, 
following the terms x s , —3a; 7 , +<£a; 6 , —3a; 2 respectively. 

When ft, m, p, ..., q are consecutive positive integers 
the polynomial f(x) is said to be complete. All these remarks 
apply also to the associated equation f{x) = 0. The theorem 
in question exhibits a connexion between these changes 
of sign and the number of positive roots of the equation. 
We begin with the following lemma : 

Lemma. If a real polynomial , complete or incomplete y 
is multiplied by x— a, where a>0, the product will contain 
at least one more change of sign than the original . 

(i) Proof for the complete polynomial. Let the 
multiplication be performed in the ordinary way, but let 
the signs only of the terms be written down ; then we shall 
have a scheme such as 

+ + + + + + - + 

+ - 


+ + + + + + - + 

+ + - + + + - 


+ ± ± - H T + ± 1 
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The top row represents the complete polynomial written 
in descending order, wherein any arbitrary sequence of 
signs may occur. The second row represents a;— a, where 

the signs are fixed as H . A double sign is placed 

where the sign of any term in the product is ambiguous. 
The following laws will be seen by inspection to hold : — 

(1) To every group of consecutive continuations in the 
original there corresponds a group of the same number of 
ambiguities in the product polynomial. 

(2) In the product polynomial the signs before and 
after an ambiguity or group of consecutive ambiguities 
are contrary. 

(3) In the product polynomial an extra term appears 
at the end, and with it a change of sign is introduced. 

Now in the product polynomial take the most un- 
favourable case and suppose that all the ambiguities are 
replaced by continuations ; then, by the second law, 
without affecting the number of continuations, the upper 
signs may be adopted for the ambiguities ; and thus the 
signs of the original polynomial will be repeated in the 
product polynomial, except that by the third law there 
is an additional change of sign introduced at the end of 
the new polynomial. Thus in the most unfavourable case 
one more change of sign occurs in the product polynomial. 

(ii) Proof for the incomplete polynomial. Introduce 
zeros, together with the signs + and — , in the 

multiplication scheme, in order to indicate exactly the 
missing terms : for example, 

+ +0 bOOH 0 1 0 + 

— — 0 -f- — 00 — -j- 0 -f- — -f- 0 — 

Between each zero, or group of zeros, is a complete subset 
of terms to which the preceding case applies, so that 
each subset gains at least one change of sign. But there 
may be a loss of a change at the actual gap, as in the 
first zero gap of the illustration above, where -}- 0 — 
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becomes . Now the number of complete subsets 

separated by zero gaps is one more than the number of 
these gaps : hence the certain gains outweigh the possible 
losses by at least one. The result is thus established. 

Descartes' Theorem . In any equation , complete or 
incomplete , the number of positive roots cannot exceed the 
number of changes of signs of the coefficients , and in any 
complete equation the number of negative roots cannot exceed 
the number of continuations in the signs of the coefficients . 

Proof. Resolve f(x) into ifi(x)(x—a)(x—b) ... (x—c) y 
where i/j(x) contains all the factors due to negative and 
to pairs of complex roots, while all the factors x— a, and 
so on, due to positive roots are explicitly given. Now 
multiply i jj(x) by x—a , x—b y ... in turn. At each 
multiplication at least one change of sign, by the lemma, 
is introduced into the product, so that the first part of the 
theorem follows immediately. 

To prove the second part we suppose f(x) to be complete 
and put — y for x ; then the original continuations of sign 
become changes of sign. Also the transformed equation 
cannot have more positive roots than it has changes ; 
and thus there cannot be more negative roots in the 
original complete equation than the number of its 
continuations of sign. 

Corollary. An equation , complete or incomplete , cannot 
have more negative roots than f(— x) has changes of sign. 

For the negative roots of f(x) = 0 are the same as the 
positive roots of f(—x) = 0. 

Examples. 

1 . x z -f 2# — 2 = 0 has one change of sign and may there- 
fore have one positive root. Actually it has a root between 
0 and 1 since /(l)>0>/(0). Now 

( —a:) 3 + 2( —x) —2 = 0 

has no change of sign, and therefore has no positive rootp; 
and so the original equation has no negative root. It has 
two complex roots. 
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2. x* — l = has one real root, namely x = 1. 

3. x 10 — 1 = has two real roots, namely x = 1, x = 1. 

4. x a — 2x 2 —5 — 0 has one real root, positive. 

5. aj 4 +a; 2 + l = 0 has no real root. 

45. A Precise Rational Test for Real Roots. 

Remarkable as the Harriot-Descartes Rule of Signs is, 
it still leaves an uncertainty as to the exact number of 
real roots in an equation : it only gives an upper limit 
to them. The problem of finding an exact test engaged 
the attention of mathematicians for the next two hundred 
years, until it was finally solved in 1829 by Sturm, who 
published his result in the Memoir es Divers des Savants 
Etrangers , Paris, 1835. Sturm showed how to find for 
any equation, by rational methods, the exact number of 
real roots which lie within any given range of values. 

Sturm's Theorem. There exists a set of real polynomials 
f(x), f'(x), f 2 (x), ..., f m (x) whose degrees are in descending 
order , such that , if b>a, the number of distinct real roots 
of f(x) = 0 between x = a and x = b is equal to the excess 
of the number of changes of sign in the sequence f, f', f 2 , ..., 
f m when x = a over the number of changes of sign when 
x = b. 

Proof . First suppose that f(x) has no repeated factors. 
Take f(x) = x n d-apr 7 *- 1 + . . . +a n , 

f'(x) its derivative, and f 2 (x) the remainder, with its sign 
changed, of the division f(x)/f f (x). Continue the G.C.M. 
process until a constant is necessarily reached, since / 
and /' have no common factor (p. 61) ; but at each of 
the m— 1 remainders change its sign before continuing. 
These modified remainders are the required functions f { ; 
and we have the following relations : 

/ = ft/' -f» 

f' = £ 2/2 “ fz> 


fm- 2 Qm-ifm-i f my 
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where each / or q indicates a polynomial in x or, in the 
final case, a constant f m . 

From these relations we can draw certain inferences : 

(1) 0, else all the preceding /’ s have a common 
factor involving x. Hence the constant f m has a permanent 
sign + or — , whatever value attaches to x. 

(2) No two consecutive /’s can vanish simultaneously ; 
else they and all their predecessors would have a common 
factor. 

(3) If any f { = 0 then, at this value of x, f {+1 and 
have contrary signs, since they must be equal and 

opposite in value. 

Now consider the sequence of signs in the set of /* s 
when x = a, which is taken at a place where none of the 
/’s vanish. As x increases from x — a no change of sign 
within this sequence can occur until x reaches a zero of 
one of the /’ s. We therefore examine what happens at 
such a zero. 

If f. = 0 at x = c, take a range {c—h, c+h) covering 
the value c, and such that no other / vanishes within the 
range. This is possible since the zeros of all the /’ s are 
definitely distinct. The signs of f { _ x at x — c—h , c, c-\-h 
are the same, let us say + + + : those of f i+l are therefore 

, by (3) above : while those of /* are 0, zb* 

Accordingly the signs of f { _ v f i+1 are + zb — just before, 
+ 0 — at, and + ± ~ J us ^ after x = c. In each case this 
group therefore presents one permanence and one change 
of sign as x passes through a zero of ; and the same 
happens if is negative at x = c. 

This argument is equally valid for each f { including 
but not for the initial member / of the sequence. As x 
increases and passes through an unrepeated root a of the 
equation / = 0, the signs of / must be either + 0 — or 

— 0 +, while the corresponding signs of /' must be 

or + + +» according as / is decreasing or increasing 
through the root. In either case the signs of /, /' differ 
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before, and are the same after, the passage of x through a ; 
hence one change of sign is lost. 

Thus we have proved that as x increases, Sturm’s 
functions never lose a change of sign except when x passes 
through a root of the equation / == 0, and never gain a 
change of sign. Hence the number of changes of sign 
lost as x increases from a value x — a to a greater value 
x = b is equal to the number of the roots of the equation 
f(x) = 0 which lie between a and 6. The theorem is thus 
established for the case of unrepeated factors. 

Consider next the case where f(x) has repeated factors. 
Then /, /' possess a G.C.M. (x—a) p (x—f}) q ...—G, 
which is common to all the Sturm functions / /', / 2 , ..., 
Let ua write / = Ocf>, /' = Gfa, / 2 = G<f> 2 , f m = G<f> m , 
where necessarily 

0 — (x—a)(x—f}) ... (x— A) 

and no two functions 0, 0 1? 0 2 ..., 0 m have a common 
factor. By the previous case the number of real zeros 
of 0 can be ascertained from the Sturm sequence of <f>’ s. 
At any value x = c for which G^ 0, the signs of the /’s 
will be either identical with those of the 0’ s (for G> 0) 
or else will be entirely contrary (for G< 0). In either case 
the / sequence has the same number of changes of sign 
as the 0 sequence. At a value x — c for which G = 0, 
no signs appear at all, for all the / terms are zero. Hence 
the examination of the sequence of signs of the / series 
between x = a and x—b discovers the number of real 
roots of the equation 0 = 0, that is, the number of real 
distinct roots of / = 0, since every zero of / is included 
in 0. This completes the proof of the theorem. 

Corollary 1. If f r is a remainder involving x and such 
that f r remains 'positive , or remains negative , for all real 
values of x between a and b, then the sequence need not be 
prolonged beyond f r . 

For in the preceding demonstration the necessary 
property of the last f was that it should never vanish, 
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and as f r cannot vanish the argument holds for the series 

f> f > fz> /r* 

Corollary 2. Any f can be multiplied by a positive 
constant , or a factor involving x, provided that the factor 
remains positive throughout the range in question , and the 
modified function can be used for computing all further 
terms of the sequence. 

For this modification leaves the essentials of the above 
proof unaltered. 

Examples. 1. If f(x) = x B ~ 3z 2 — 4x4-13 = 0, 
f'(x) = 3x 2 -Qx -4, 

; f 2 ( x ) = 2^ — 5, 

/.(*> = h 

The computation can be arranged as follows : 


1 —3 -4 +13 | 

1 

CO 

1 

CO 

3 -9 -12 +39 

6 -12 -8 

1 

CO 

1 

CO 

6 -15 

-3 —8 +39 

+3 -8 

-3 +6 +4 

+ 3 -7J 

— 7 ) — 14 +35 


2 -5 



This is the ordinary G.C.M. process ; only after each remainder 
f 2 and / 3 the signs have been changed. To avoid fractions, 
f has been multiplied by 3 before the division. Division by 
—1 changes the sign of the first remainder and gives a suitable 
/a» by Corollary 2. Next a scheme of signs can be devised : 



/(*) 

/'(*) 

/»(*) 

fz( x ) 

Changes 

— 00 

— 

+ 

— 

+ 

3 

-10 

— 

+ 

— 

+ 

3 

0 

+ 

— 

— 

+ 

2 

1 

+ 

— 

— 

+ 

2 

2 

+ 

— 

— 

+ 

2 

3 

+ 

+ 

+ 

+ 

0 
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In the first column we can write arbitrary values of * ; 
in the last column we write down the number of changes of 
sign in the sequence. Here there is one negative root between 
— 10 and zero, and two positive ones, between x == 2 and 
x — 3. Between which integers is the negative root ? 

2. If /(*) = 2* 4 - 1 3* 2 + 10*- 49, 

fyx) = 2(4* 3 -13*4-5), 

/ 2 (*) = 13* a -15*4-98. 

Here /, has no real factors and is always positive for real 

values of *. The signs at * = — oo are -{ {- and at * = 4- °o 

are 4- 4- 4- ; so that there are two real roots, as shown by 

two losses of change in sign. At * = 0, the signs are f- -f, 

implying one negative and one positive root. 

3. Locate the roots of cc 3 -{-x 2 — 4*4-1 = 0. 

4. Locate the roots of * 4 — 5* 2 4-8* = 10 and evaluate 
a root, l<a<2, to three decimal places. 

5. Find the real root of 48* 3 = 3* 2 4~3*4-L 

6. Also of * 3 4-5* 2 -f *=27. 

7. The equation * 3 — 3*-f 1 = 0 has three real roots, a, /?, y. 
Prove also that the roots of a 3 — 3* 2 + 3 = 0 are j3-f y+l/a 

and two similar expressions. 
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BINOMIAL AND RECIPROCAL EQUATIONS 

46. The Binomial Equation. If each coefficient of 
ffa) — a 0 x n -\-a 1 x n - 1 +...+a n — 0 . . (1) 

vanishes except two, the equation is binomial in form, 
and reduces, apart from zero roots, to 

x m — a = 0, . . (2) 

which is called the binomial equation. By means of 
Demoivre’s theorem this can always be solved. 

There are two cases, the arithmetical and the complex. 

The Arithmetical Case. In this case we assume that 
a>0. The equation x n — a = 0, accordingly, has not 
more than one positive root, by Descartes’ Rule of Signs : 
and since /( 0)<0, f(o o)>0 it has exactly one root, which 
we denote by n Ja, and call the arithmetical nth root of a. 
Here a may be a perfect nth power of an integer or rational 
fraction, in which case x is rational, or a may not be a 
perfect nth power of a rational number, in which case x 
is irrational. 

Example. If x 5 — 32 = 0, x = 2. If a^— 36 = 0, x is 
^36, an irrational number. 

The root may be computed as accurately as may be 
desired in various ways — from a graph for preliminary 
location, by logarithms, by the binomial theorem, or by 
Horner’s method. 

The binomial theorem is most effective when a is nearly 
a perfect nth power. Here, for instance, 

108 
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j^/36 = (32+4) 1 / 5 = 2(l+l/8) 1 /s 
’ = 2(1+1/40-1/800+...) 

= 2-0475... 

from three terms of the series. The use of a seven-figure 
table of logarithms hardly improves on this. 

Example. Find the real root of x* — 60 = 0 to seven 
places of decimals by Homer’s method, comparing the result 
with that given by seven-figure logarithms. 

The Complex Case. Let a be written in the complex 
polar form 

a == r(cos 9+i sin 0), 

where r> 0 : and let b n = r, where b = n Jr, the arithmetical 
nth root of the modulus r of a . If we now put x — by the 
equation x n = a reduces to 

y n = cos 9+i sin 0, 

which is solved by taking 

04-2577 . . 04-2577 

y = cos \-i sin , s = 0, 1, 2, ..., n— 1. 

n n 

Proof. If y has this value, then by Demoivre’s theorem 

y n = cos(0 4-257 t) +i sin(04-257r) 

= cos 9-\-i sin 0 

for each value of s. Hence the stated value of y is a root 
of the equation. 

Also for the n given values of 5, y takes exactly n 
different values, which are represented by points on a 
circle of unit radius whose amplitudes differ successively 
by 277 /ft ; and such points lie at the n vertices of a regular 
pplygon inscribed in the circle. Hence each possible root 
of this equation of the ftth degree has been identified, 
for there cannot be more than n roots. 
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Furthermore, if s takes higher values n , w+1, ..., or 
else negative values, the same vertices are repeated. For 



Geometrically the solutions of the equation 
n = e'O = cos 6-\-i sin 0 


s 1 
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where 9 is, of course, real, are given by the following 
construction : Draw a circle with centre 0 and of unit 
radius OA. Take P a point on it such that the angle 
AOP is 0. Mark the point N x on the circle such that the 



angle AO^ is 9jn : then mark the n vertices of a regular 
inscribed polygon NjN 2 ...N n . The complex numbers 
answering to these n vertices are the n distinct nth roots 
0 f The figures exhibit the cases n = 5 and 6. 

General Solution of x n = a. By combining the 
above results we obtain the n values 


/ 0 + 2«S7T . . 0 -\- 2 s 7 t \ ^ ^ 

x =\a\ 1 l n I cos H sin 1,5 = 0, 1, 2, n— 1, 

\ n n ] 

where \a\ denotes the modulus of a, and 6 is its amplitude. 


Examples. 

1 . * 3 + 8 = 0 . 


Here a — — 8 = 8 (cos 7 r+i sin 7 t), 


and so 


( 7T 7r\ 

cos — b i sin—* | , 2(cos it + i sin 7 t), 
3 3/ 


s= 1 ~b i\/3, — 2, 1 — iVlb 


0 / 5tt . 5t t\ 
21 cos — + ism y I 
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The regular polygon in this case is an equilateral triangle 
with one vertex at the point ( — 2, 0). 


2. x 1 = 1. 


Here a — 1, <j> = 0, x — cos 


2 8tt 
~1~ 


. . 28tt 
+i sin 


3. x* = i . Here a 


. 7T . . 7 r 

i = cos --B Bin 


(4s + 1)tt 
a? cos 10 


-fi sin 


(4^ + 1)7t 
10 * 


4. The real roots of x n — 1 =0 are i 1, if n is even, and 
-f-1 only, if n is odd. On the other hand, £ n + l =0 has a 
single real root — 1 if n is odd, and no real root if n is even. 

5. Group the complex roots of x n — 1 =0 into conjugate 
pairs. Hence resolve x n — 1 into real factors. 

2S7T . 2S7T 

(Take x — cos d:* sin as such a pair. Then the 

n n 

quadratic polynomial 



Vsin* 2 -l* 


vanishes for these two values of x ; 


that is, x 2 — 2x cos ^^-fl 
n 


is a factor of x n — 1 for integral values of s. 

If n is odd there are \(n — 1) such quadratic factors, where 
8 — 1, 2, 3, ...» \(n — 1). If n is even, there are £(n — 2) such 
factors with 5 = 1, 2, 3, ..., \(n — 2). This accounts for all 
the complex factors, but there are also real linear factors, 
namely (x — l)(cc-f 1) when n is even, and x — l when n is odd.) 


6. £ 5 — l = (x — l)^x 2 —2x cos ~ -flj^r 2 — 2x cos lj- 

7. #• — 1 = (x 2 — l)[x 2 —2x cos \^{x 2 —2x cos ^-{-lj. 


47. Euclidean Construction of the Regular Polygon. 

The solution of the binomial equation x n — 1 == 0 and the 
construction of a regular polygon of n vertices are evidently 
closely connected, and already we have considered a general 
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trigonometrical form of the solution. But there are 
other more purely geometrical or algebraical methods at 
our disposal, and it is natural to find out how far, for 
instance, the time-honoured ruler and compass construc- 
tions of Euclid would carry us in forming a regular polygon 
of n sides. Euclid gave us the triangle, square and 
pentagon (n — 3, 4, 5) and any figure deducible from 
these by repeated bisection or by combination of cases. 
In fact whenever n = 2 m .3.5 a regular figure could be 
drawn by ruler and compass. 

For example, to construct a regular 15-gon, take the arc 
of the pentagon, which is a fifth of the circumference, from 
the arc of the triangle, which is a third, and bisect the resulting 
arc. 

But how shall we deal with such cases as n — 7, 9, 
or 11 ? Great interest has been taken in such problems 
throughout the ages, and notable results were found by 
Lagrange and Legendre, who examined the corresponding 
algebraic equations ; but it was Gauss who made the 
greatest discovery in this field since the time of Euclid, 
for in 1796 Gauss proved that the 17-sided regular polygon 
could be drawn with ruler and compass. In fact he proved 
that the case when 

n = 2 2w 4-l 


is always possible provided that n is prime. The lowest 
values of n satisfying this condition are n = 3, 5, 17, 
257, 65537. 

Algebraically, the use of linear and quadratic equations 
is equivalent to such ruler and compass constructions ; 
and it can be shown that the solution of the equation 
x n — 1 = 0 is reducible to that of quadratic equations by 
what are otherwise rational processes, in precisely those 
cases in which, geometrically, the ruler and compass 
succeed. We cannot attempt any proof here, but one 
illustration can be given. 

H 
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Example. Take n = 5, x b — l — 0. Segregate the factor 
x — 1, leaving 

x*-\-x 2 -\-x 2 -\-x-\-\ = 0. 

This is a simple instance of a reciprocal equation (48). To 
solve it we put y — x + l/x, so that y 2 = x 2 +2-\-l/x 2 . Hence 

x 2 +x+l 1/z 2 = y 2 +y~ l == o. 

We can solve this equation for y , and we can also solve the 
quadratic equation corresponding to the substitution, 

x 2 — xy-j-1 = 0, 

as a quadratic for x in terms of y ; in fact we have 

x = iy±V i2/ a — l, where y = — JrLjV 5. 

The quartic equation in x has thus been solved by the 
successive use of two quadratic equations. 

48. Reduction of Degree of Certain Equations. 
Reciprocal Equations. In the first place there are 
certain rather special types of equation which are reducible 
to equations of lower degree and root extractions. It is 
easy, for example, to reduce equations such as 

x 2n -\-ax n -\-b = 0, or x Zn -\-ax 2n -\-bx n +c = 0 
by the substitution y = x n . 

Examples. 

x 9 -\-9x* + $~0, {x 2 +3x+2) 2 -4{x 2 + 3x+2) + 3 = 0. 

Such instances are easy to devise, but they seldom 
occur in practice. A more important and general case is 
that of the reciprocal equation : 

a 0 x n +a 1 a: n “ 1 +a 2 a: n - 2 +...+a 2 a; 2 +a 1 x+a 0 = 0, 

where a 8 — a n _ 8 for 5 = 0, 1, 2, ..., n. 

Theorem. The reciprocal equation of even degree is 
rationally reducible to an equation of half that degree. 

Proof. Put z — x-\-l/x, so that z 2 = x 2 +2 J tHx 2 i 
z 8 = x z -\-Zx+Zlx J r\jx z , and so on. Then 

x+l/x = z, x 2 +l/x 2 = z 2 — 2, x B -{-ljx z = z 3 — 3z, 
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and so on. By this iteration we express 

x*+l/x s as z 8 — sz 3_2 +..., 

that is, as a polynomial in z of degree s , for each value 
of s in succession. Now put n = 2m and divide the given 
equation throughout by x m , so that it can be rearranged as 

a 0 (x m +llx m )+a 1 (x m - 1 -\-llx m ~ 1 )-\-...+a m = 0. 

Substituting for each bracketed expression its equivalent 
as a polynomial in z we obtain an equation in z of degree m. 

Corollary. A reciprocal equation of odd degree 2m +1 
can be rationally reduced to one of degree m. 

Proof. In such a reciprocal equation f(x) = 0 we find 
at once that /( — 1) vanishes : hence £+1 is a factor of 
f(x). If f(x) is written [x+l)ip(x) then i/j(x) is of degree 
2m and is also reciprocal, since the roots of i/j(x) = 0 
are those of f(x) — 0, with the root — 1 removed ; and the 
roots of f(x) = 0 are reciprocal (42, p. 98) in pairs. 

The preceding theorem now applies, and reduces 
f(x) = 0 to an equation of degree m. 

Analytically, f(x) = x^x* 1 ) is the relation characteristic 
of a reciprocal equation f(x) = 0 of degree n. 

Certain equations are reciprocal in respect of the 
numerical values of the coefficients, but alternate in the 
sign of the coefficients, in such wise that the terms may 
be grouped as 

X — l/Xy x 2 +l jx 2 , x z — 1/x 3 , ... . 

When this happens the transformation z = x— l/x will be 
effective. 

Other equations can sometimes be transformed to the 
reciprocal form. For example in 

16a^+8aa; 3 +4&:c 2 +2a;r+l = 0 

vte should first put 2x — y } then y+l/y = z. The equation 
then reduces to z 2 -\-az-\-b— 2 = 0. 
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Examples. 

1. Solve 6a; 4 + 5a; 8 — 3 8a; 2 -f 5a; -j- 6 = 0. 

(Put x + l/x = z. Then 6(z 2 — 2) +52 — 38 = 0, so that z = 2\ 
or — 3J. Hence aj-f-l/a? = 2J or — 3£, and x — 2, —3, — £.) 

2. Solve 2a; 5 -f 7x*+9x 3 +9x 2 +7x+2 = 0. 

(Divide the polynomial by a;-}-l, getting 

2a; 4 -j- 5a; 3 -f 4a; 2 + 5a: -f 2 = 0. 

The solutions are a; = —1, —2, — ±t.) 

3. Solve 2a; 4 -f 3a; 3 — 4a: 2 — 3a: + 2 = 0. 

(Put x — l/x — z. Solutions are a? = 1, — 1, —2, J.) 

4. Solve 6a; 4 + 3 5a; 3 -f- 6 2a; 2 4* 3 5a; -J- 6 = 0. 

6. Prove that the points of the Gauss plane representing 
the five roots of (z-f l) 6 = 32z 5 are coney clic. 

6. Calculate the fourth roots of 8(— 
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THE CUBIC EQUATION 

49. Historical and Introductory. Examples of cubic 
equations have recently been discovered by O. Neugebauer 
among ancient Babylonian records. Cubic equations 
were considered by Diophantus about a.d. 300, but the 
first European mathematicians to give a complete solution 
of them belonged to the Italian school of Bologna at the 
time of the Renaissance : they were Scipio Ferro, Nicolas 
Fontana, surnamed Tartaglia (that is, the Stammerer), 
and Cardan (who visited St Andrews in Scotland in 1552). 
The solution which usually bears the name of Cardan, 
who brought it into prominence, is really due to Tartaglia. 
The general cubic equation is 

= 0 . . . . ( 1 ) 

By taking x = y— \a x we can reduce it at once to the 

form 

y z +(a 2 -^a\^y+ ^a\- l -a l a 2 +a 3 = 0. . (2) 

This equation can now be written as 

x^ ~\~dx-\-b = 0 . • • (3) 

which we shall call the reduced form of the cubic. It lacks 
a term in x 2 : otherwise the coefficients are general. 

If a 2 happens to be zero it is quicker to put x = 1/y, 
in order to reach a reduced form of cubic equation. 
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50. Cardan’s Solution of the Cubic. To solve 

x 3 -\-ax+b = 0 . . . (1) 

put x = z-\-v. Then 

x 3 = z 3 -{-v 3 -\-?>zv{z+v)\ m 

= z 3 +v 3 +3zm;. / # 

Equation (2) may be regarded as a cubic equation in x 
and is in reduced form, since there is no term in x 2 . 
Equations (1) and (2) are the same provided that 

z 3 +v 3 — —b, 3 zv = —a. . . (3) 

To solve this pair of simultaneous eqations for z and v , 
write z 3 v 3 = — a 3 /27, so that z 3 and v 3 are the roots of the 
quadratic 

A 2 +W— a 3 /27 = 0, ... (4) 

where the sum of the roots is —b, and the product of the 
roots, namely z 3 v 3 , is —a 3 /27. Hence A = z 3 or v 3 ; 
that is, 

A = -^±V(^ 2 /4+a 3 /27) = z 3 , v 3 . . (5) 

Since z and v have entered the work symmetrically, it 
does not matter which is which : and so we let z 3 be given 
by the positive, and v 3 by the negative, sign. Finally 
we take the cube roots and add, so that 

iS = z+ „=V'{-J6+V(6V4+a 3 /27)}+^{-ife-V(6 i '/4+« 8 /27)} . (6) 
This is the celebrated formula of Cardan (1573). 

Example. ^+3^ + 8 = 0. 

Here x = z+v, z 3 -\-v 3 = —8, zv — —1 and A 2 -h 8 A — 1 = 0, 
so that A == —4 dbV^* 

Thus x = ^(-4+V'n)+^(-4-v'lV). 

Since every number has three distinct cube roots 
(p. 109) we have evidently obtained several values of 
x by Cardan’s formula. This is as it should be, for a cubic 
equation usually has three distinct roots. But at first 
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sight it appears that there are nine, or even eighteen, 
possibilities in the formula, since alternative solutions 
exist for square and also for cube roots. As for square 
roots, a glance shows that the signs are fixed — one must 
be positive and one negative. Accordingly we consider 
the cube roots. 

Let the distinct roots of x 3 — 1 = 0 be 1, co, a> 2 , so that 

l+co+co 2 = 0, a> =—!+»—, O) 2 = — i 

Then (d 3 = 1, and the cubes of z , coz, oj 2 z are each equal to 
z 3 . Similarly for v. Hence there are three -times- three ways 
of taking the formula (6) for x , owing to these combinations 
of the pair of cube roots. But whereas each of the nine 
ways will satisfy the first condition 

z 3 -j-v 3 == —b (e.g. (coz) 3 -f (on;) 3 = —6) 

only three ways will satisfy 3 zv = —a, namely z with v , 
coz with cu 2 v, and a> 2 z with cov. We infer that the three 
roots for x are given by the formulae 

x = z- \-v, coz-\-a) 2 v, o) 2 z-\-(x)V f . . (7) 

where z and v are the cube root expressions of (6). 

A check upon the accuracy of this result is afforded by 
adding together these three roots. The sum is 

(1+m+o >*)(z+v), 
which vanishes since 1 +co +co 2 = 0. 

Example. x 3 — 18a; — 35 = 0. 

Here X = 2+3, 2o;-f3o r 3 , 2ai a -f3ai, 



The sum of the roots is zero. 

51. The Case of Equal Roots. If two of the roots 
are equal we equate a pair of the results 50 (7). For 
example, let z-j-v = (oz-j-co 2 v f so that (1 —co)z = (co 2 —l)v 
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and z = (— 1— a>)u = c o 2 v. From the three possible pair- 
ings of roots we find that when 

z — v or cov or co 2 v . . . (1) 

two roots are equal. Substituting in 50 (3) we have 
2v 3 = —by —a — 3v 2 or 3c ov 2 or 3co 2 v 2 , 
so that 4v 6 = 6 2 , 27v 6 = —a 3 in each case. Hence 

A = 6 2 /4+a 3 /27 = o. . . . ( 2 ) 

The expression on the left side of this equation is called 
the discriminant . Conversely, if this discriminant vanishes 
then z = v or ouv or o> 2 v, as we see from 50 (4), for both 
z and v are in this case cube roots of — \b : and accordingly 
two roots are equal. 

Three roots of the equation x 3 +ax-\-b = 0 are equal 
when and only when a = b == 0, as is seen by making all 
the values (1) equal, so that z — v — 0. In this case the 
unreduced equation is more interesting, and the condition 
for three equal roots is that 

x 3 -\-a 1 x 2 -\-a 2 x-\-a 3 

should be a perfect cube. If so, it must take the form 
(x— a) 3 , and therefore a 1 = —3a, a 2 = +3a 2 , a 3 = —a 3 . 
This requires two conditions to connect the three co- 
efficients : for example, 

a\-\-3a 2 = 0, a x a 2 — 9a 3 = 0. 

The expression A is not only the discriminant of the 
cubic but also of equation 50 (4), which is called the 
quadratic resolvent. We shall now see that it discriminates 
further between real and complex roots. 

52. The Ordinary Case A >0. If the quantity 
A = 6 2 /4+a 3 /27 

is positive the formula 50 (4) gives real values of both 
z and v , so that x = z+v is also real. In this case the 
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other two values of x are (uz+ojH and co 2 z+cjv. But 

oj = -\-\ri > 

hence these values of x are 

—W+v)±\iy/^{z~v) i 

which are necessarily complex, since z—v cannot vanish 
unless A — 0, whereas A has just been assumed to be 
positive. We infer that when A >0 two roots are necessarily 
complex, and one only is real. 

The Irreducible Case, A<0. The above certainly 
suggests that the outstanding possibility, namely the 
case when A< 0, furnishes the criterion for the existence 
of three real roots x , which is indeed the case. In fact, 
if A< 0, then the quadratic of 50 (4) for A has complex 
conjugate roots, giving, let us say, 

z 3~p-j-iq f v z — p — iq } . . ( 3 ) 

where p = —\b and q = y/(—A ), which are both real. 
It is possible to proceed by using Demoivre’s theorem, 
after putting 

p = r cos 9, q = r sin 9 , 


so that r and 9 are real. Then, by Cardan’s formula, 
x = %(p+iq)+ ^(p-iq) 

= ri(cos 6-\-i sin 0)i-fr^(cos Q—i sin 6)i 
-r*(. 


d-\~2JC7T . . Q -\-2tkTT 6 ~f~ 2^7T , . d-{-2JC7T\ 

cos — h^sin 1-cos — ism- 


= 2ri . 


3 

0-\~2kir 


3 3 

, where k is any integer 


3 / 


(4) 


Incidentally we have satisfied the condition 3 zv = —a, 
where a is of course real, by taking the same value of k 
in extracting the cube roots of p+iq and p—iq. In other 
words, z and v are conjugate complex numbers. 
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By taking k = 0, 1, 2 we obtain three distinct values 
of oos and therefore of x, each of which satisfies 


the cubic equation, so that we have discovered the 
requisite roots. Taking further integer values of k merely 
repeats the same values of a?, owing to the identity 


cob(?+&)- 


cos -. Furthermore, these roots are all 


real. Hence if Zl < 0 all three roots are real. 


Conversely, if all three roots are real, then z+v must be 
real, so that z and v must be conjugate complex numbers 
h±ik ; and it is easy to verify that each of the three roots 
as given by z-j-v, cuz-}-a) 2 v, co 2 v-j-a)V is real. Hence 

A = 6 2 /4-fa 3 /27 = \{z 2 -\-v 2 ) 2 — z z v z 

= J(z 3 — v 3 ) 2 = (3 ihk 2 -\-i z k z ) 2 = —(3 h—k) 2 k A . 


This expression is either zero (giving the case already 
treated in 51 of two equal roots) or is negative. Hence 
if the roots are real and unequal then A< 0. 

Once more, in this case, p = — \b, q — so that 

r 2 = p 2j f-q 2 — \b 2 —A == —(a/3) 3 , 


while r cos 6 = p , that is, 


Hence 


cos e = — 6/2\/(— a 8 /27). 


x = 2 



cos 


6~\~2k7T 


k = 0,1, 2. 


( 5 ) 

( 6 ) 


We call this the trigonometrical solution of the cubic 
equation. 

To sum up : if a and h are real the cubic equation 
x 8 +aa;+6 = 0 has 

(i) three real roots when A< 0, 

(ii) two equal roots if A == 0, € 

(iii) one real and two complex roots if A >0. 
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Here A = 6 2 /4+a 8 /27 : it is the discriminant both of the 
cubic and of the resolvent quadratic A 2 +6A— a 3 /27 = 0. 

In case (i) z and v are conjugate complex numbers, 
in case (ii) they are real and equal, in case (iii) they are 
real and unequal. 

Cardan's formula at once yields the real roots in 
cases (ii) and (iii), but, paradoxically, cannot do so in 
case (i) without introducing imaginary numbers. To 
mathematicians of the sixteenth and seventeenth centuries 
this feature was very mysterious : they spoke of the 
irreducible case. The method of Demoivre clears the 
matter up and yields the roots ; but it still remains a 
curious fact that from a real cubic three real roots cannot 
be extracted by Cardan’s algebraic formula without a 
circuitous passage into, and out of, the domain of complex 
numbers. (See 53.) 

Examples. 

1. * 3 + 12:r-112 = 0. (Cf. p. 148, Ex. 11.) 

Cardan’s formula gives 

» = ^{56+ +(3136 + 64)} + {56- +(3136 + 64)} 

= ^112-66854+ ^(—0-56854) 

= 4-828... -0-828... = 4. 

The answer is exactly 4, but the working involves the use 
of tables. For the other roots, divide by # — 4. Then 

» a +4 x+28 — 0, and x = —2±i2\/6, 

2. Solve a? 3 + 3x 2 + 15o; + 25 = 0. 

Put x + l — y, then y z -\-\2y + 12 = 0. 

Here a = 12, 6 = 12, A = 100 and 

y = ^(-6 + 10) + *((—6 — 10) = 1-5874-2-6198 = -0-9324, 

so that x —y — \ — —1-9324. This is the only real root. 
/Complex roots for y are given by 

<u ^4-a>»2 ^/2 and co 1 ^4-a>2 ^2. 
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To evaluate them, call them a+ip, a— ip ; then 
-2a = -0-9324, 

since the sum of the three roots must be zero. Also the 
product of the roots is —12. Thus 2a (a 2 -f-/? 2 ) — 12, or 
p 2 = 6/a — a 2 . Finally a = 0-4662, p = 3-5571 and 

x = y - 1 = a-1 ±ip = — 0-5338 ±3-557 li. 

53. Alternative Trigonometrical Treatment. If 

real roots only are required we may proceed as follows. 
Consider the identity 

4 cos 3 ^— 3 cos (f> — cos 3^ . (1) 

and suppose x*-\-ax-\-b = 0. If these two equations are 
the same, then, on comparing coefficients, we have 

x 3 /(4 cos 3 <f>) — — ax/ (3 cos </>) = —6/ cos 3<£ 
so that — 3x 2 = 4a cos 2 </>, or £ = 2\/(— Ja) cos (j> 

and 6 = 2\/(—a 3 /27) cos 3<^. 

Thus cos 3^ is known in terms of a and 6, whence 
x is known. It leads to the result 52 (6) on writing 
cos !(d+2&7r) for cos </>, and thus it solves the case of 
three real roots. 

Now cos 3 <f> cannot exceed unity : hence 
6<-2V(-a 3 /27), 
so that A = 6 2 /4+a 3 /27<0. 

(i) If, here, A = 0, then cos 3</> = ±1, or = 2&7 t/ 3 
and cos <f> = 1, — J, — J according as lc — 0, 1, 2. This 
gives 

x — 2a, — a, —a 

as roots, where a = \/(—a/ 3). It is the case of repeated 
roots, as we have seen already in 51. 

(ii) If J<0, then 

* = 2v / (— o/3) cos /<£+ & = 0, 1, 2. 
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When, however, A> 0, cos 3 <f> is greater than unity. We 
obtain a real result by assuming 

x = 2y'(— a/3) cosh<£, 6 — 2\/(— a 3 /27) cosh 3 <f>, A> 0, 

since 4 cosh 3 <£— 3 cosh<£ = cosh 3(f). Since cosh 3 <f> is non- 
periodic as a real function, it is impossible to obtain 
further real roots. Indeed, as we already know from 52, 
the other two roots are complex. 


Examples. 

1. x z -3x-\-\ = 0. 

Here a — — 3, b = 1, A = — §<0. 


Take r 2 — — a 3 /27 = 1, cos 6 = — 6/2r = — £, so that 
6 = 120°. Then 


x = 2 cos 

2. Solve 

3. Solve 


120 + 360&\° 

- j = 2 cos 40°, 2 cos 160°, 2 cos 280°. 

* 3 -27s + 27 = 0. 
a; 3 — 6# — 9 = 0. 


54. The Equation of Squared Differences of the 
Roots. Let a, /8, y be the roots of the cubic equation 
x*+ax+b = 0, and let us seek an equation whose roots 
are (£-y) 2 , (y-a) 2 , (a-,8) 2 . 

We take y = (y3 — y) 2 = a 2 +j5 2 +y 2 — 2a/?y/a — a 2 . 

But, by 28 (3), a+/3+y = 0, /3y+ya+a/3 = a, ajS y= —6, 
so that a 2 +/3 2 +y 2 = (a+/?+y) 2 — 2(/3y+ya+aj3) = —2a. 

Therefore y — —2a +26/ a— a 2 = (26— 2aa— a 3 )/a. 

But a 3 +aa+6 = 0. 

Hence (i/+a)a = 36 or a = 36/(y+a). 

On substituting this value of a in the cubic we have 

t/ 3 +6ay 2 +9a 2 ?/+4a 3 +276 2 = 0 . (I) 

Now one root of this new cubic is y = (/J— y) 2 . Hence, by 
symmetry, the other roots are (y— a) 2 , (a— jS) 2 . Also, the 
•product of these three roots is — (4a 3 +276 2 ), which is 
-108 A. 
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Thus (j8-y) 2 (y-a) 2 (a-£) 2 = -108 J . (2) 

This is an important result, for it expresses the 
discriminant A explicitly as a symmetric function of the 
roots. It verifies (i) that A vanishes when, and only 
when, two roots are equal, (ii) that zJ<0 when all the 
roots are real and unequal, and (iii) that A > 0 when two 
of the roots are complex conjugate. 

55. The General Cubic Equation. Let us now 

consider the general cubic equation 

a 0 a; 3 +3a 1 a; 2 +3a 2 a;+a3 = 0, . . (1) 

where for convenience in what follows the binomial 
coefficients 1, 3, 3, 1 are inserted. If z = a 0 x+a ly then 

z 3 +3(a 0 a 2 — a 2 )2+ a o a 3“ '^ a o a i a 2+^«f = 0 . (2) 

as is easy to verify. It is usual to write 

H = a Q a 2 —a O = 3a 0 a 1 a 2 +2aJ . (3) 

If a, /?, y are the roots of the cubic in x, then the roots 
for z are evidently 

a o y~H*i • • • (4) 

But a 0 a + a i“( a o/?+ a i) — a o( a ~P)> which is the difference 
of the roots for z. Also the preceding result 54 (2) applies 
to the equation for z, since this lacks a term in z 2 ; and 
we shall have 

^(i8-y) 2 ^(y-a) 2 ^( a-jS) 2 = -108J, . (5) 

where 

A = 6 2 /4+a 3 /27 = G 2 /4+27tf 3 /27 = £(G 2 +4H 3 ) . (6) 

Hence, in terms of the roots a, /?, y of the general 
equation 

a 0 a; 3 +3a 1 a; 2 +3a 2 a;+a3 = 0, 

we have , 

2 (y— a ) 2 (a— jS) 2 = -27(G 2 +4tf 3 ) = -108 A . (7) 
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This exhibits the discriminant A in a more general 
light : it is called the discriminant also of the more general 
cubic, and the characteristic properties remain true of this 
cubic according as A > = <0. 

Since 3// and 0 have taken the place of a and 6 in the 
original treatment, we can sum up as follows : 

The general cubic has three unequal roots if <2 2 +4 J ff 8 <0, 
has one real and two complex roots if (r 2 +4/f 3 >0, and 
two equal roots if G 2 -\~4H 3 = 0. 

It has three equal roots if G — H = 0, for in this case 
we find at once that aj a x = aja 2 = a 2 /o 3 , and the cubic 
in x becomes (a^+Gq) 3 = 0. 

56. The Canonical Form of a Cubic. The co- 
efficients a { of a cubic 

CIqX^ -\~^C lyX 2 -\-^CL 2 X-\-d^ . • (1) 

may be regarded as the first four coefficients of a recurring 
series 

S = +••• J t a rf' n ~K«‘> • (2) 

where a n — wun-i+^n - 2 • • . (3) 

is the scale of relation (23), p. 53, and 

d 0 v+a 1 u—a 2 = 0, a 1 v+a 2 u—a 3 = 0 . . (4) 

are the initial equations which give u and v. Thus 

— . . (5) 

a Q a z -’d 1 a 2 a x a z —a^ a z a 2 —a\ 

By the usual procedure (p. 54) the sum S is given as 

S = {a 0 +(a 1 —ua 0 )t}/(l--ut--vt 2 ) . . (6) 

which breaks up into partial fractions 

P , <L_ 

l+o* ^ 1+p t 


• ( 7 ) 
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when a^/J, that is, when w 2 +4t;^0. Here a, j3 are 
given by the identity 

(l- tt< ^2) =5 ( 1+<rf )( 1+ ^) > . . (8) 

so that a and )8 are the roots of the quadratio 
x 2 +ux— v = 0, 

or (a 0 o 2 -af)x 2 +(a 0 a 3 -a 1 a 2 )a:+(a 1 a 3 -o|) = 0, . (9) 

which is called (Aitken, Determinants and Matrices , p. 130) 
the Hessian of the original cubic (1). The reader may 
verify that the Hessian has equal roots when A vanishes. 
The Hessian can be written also in the alternative forms 

a Q x a-^x ~ J - a 2 cq a 2 

= 0, cq a 2 a 3 = 0. . (10) 

a 1 x+a 2 a 2 x-\-a 3 1 —x x 2 

On expanding the series for (7) and comparing the result 
with the series 8 , we have 

p+q = a 0 , pa-\ qfi = — a lf pa* \ qfi 2 =a % , pa*-\ qp* = —ct 3 . ( 11 ) 

Multiply these by a; 3 , —3a: 2 , 3a;, —1 respectively and add; 
then 

p(x— a) 3 +q(x— jS) 3 == a 0 a; 3 +3cqa; 2 -\-%a 2 x-\-a z . . (12) 

Since p and q are known by the method of partial 
fractions, we have thus expressed a cubic as the sum of two 
cubes of linear forms. 

This is called the canonical form of the cubic, and the 
reduction to such a form is possible, whenever the roots of 
the Hessian are distinct, that is, when A =^0. 

The result is due to Sylvester, who showed that the 
same method would apply to any form of an odd order, 
cubic, quintic, septimic and so on. For a quintic 

Q = a Q x^+^a 1 x^-\-\()a 2 x z +\0a 2 x 2j r6a l x J ra b . (13) 

we take a recurring series 8 = Ea n t n as before, but with a 
scale of relation 

a z = a 2 M+<qt?+a 0 w, a 4 = a^u+a^+a^w, etc., . (14) 
where (1 —ut—vt 2 —wt z ) = (l-fa£)(l+jS£)(l+y£) . (15) 
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and 


8 


— — = Za n t». 

1 +7^ n =0 


1+a t 1 1+pt 
The quintic is readily found to be 

Q = P(x-a) 5 +q(x— p) 5 +r(x—y)\ 

where a, /?, y are the roots of the cubic 
x z -\-ux 2 — vx-\-w = 0, 


or 


a^x-\-a x a x x-\-a 2 a 2 x-\-a 3 
a x x-\-a 2 a 2 x-\-a 3 a z x+a A 
a z x-\~a A a^x-\-a b 


= 0, or 


• ( 16 ) 


<Xq 

a i 

ci 2 

« 3 


a x 

a 2 

a 3 


= 

d 2 

a z 


«5 


1 - 

-X 

X 2 

-a; 3 



0. (17) 


The determinant, of order n when the original form is of 
order 2n— 1, is called the canonizant of the form. The 
method also applies when the roots of the canonizant are 
repeated : we then modify the partial fractions in the usual 
way and proceed as before. 

Examples. 

1. If, for the cubic, a = 


then 


+ 


1+a t (1+aO 2 


and the canonical form is 

r(x — a) z +s(x~ a) a , where r = p-\-q f s = —3 qa. 


2. Reduce a; 3 — 3a; + 2 to canonical form 

(a = 0 = 1 ; (a— *l) 8 + 3(a? — I) 2 ), 


3. Reduce 9# 3 — 6;r 2 + 18a;+7. 

4. If a , /?, y are the roots of a cubic and a> is a complex 
cube root of unity, prove that the roots of the Hessian are 

£y + tuya + a> 2 a/3 /?y + tu 2 ya + ct>a/J 

a+a>j3+a> 2 y a+6o 2 )3+ct>y 


I 
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THE BIQUADRATIC OR QUARTIC EQUATION 

57. The Quartic Equation : Feasibility of a 
Solution by Radicals. The equation 

/ =zf(x) = a 0 a; 4 +4a 1 a; 3 +6a 2 a; 2 +4a3a;+a 4 = 0 . (1) 

is called a biquadratic or quartic equation. In general it 
has four roots a, /?, y, 8 which can be found algebraically 
by reducing the equation to a cubic equation in rational 
steps. Since it is known that equations of degree higher 
than the fourth cannot be so reduced to lower equations 
it is of interest to enquire why a quartic admits such a 
reduction. The underlying reason is simply this : that 
if a cubic equation 

z*+pz*+qz+r = 0 . . . (2) 

is formed, the roots of which are 

/fy+a 8, ya+£S, a/?+yS, . . (3) 

then each of its coefficients is a symmetric function of 
a, j3, y, 8* (This is easily verified : for instance the sum 
of the three roots is Zfiy.) Accordingly, p, q, r are known 
rational functions of the original a 0 , a ly a 2 , a 3 , a 4 . But 
the cubic can be solved ; so that fiy+ah is known. Also 
apy8 = aja 0 . Thus we can obtain fiy and aS as roots 
of a quadratic. Similarly for all six binary products ft y. 
Hence each ratio a/J : ay and therefore : y is known, 
from which we can find not merely the ratios of the 
roots but the roots themselves by using the fact that 
Za == Icq/ a$. 

#£. 

68. Ferrari *s Method of Solution. Let the quartic f 

no 
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of 57 (1) be multiplied by a 0 and then expressed as the 
difference of two squares 

(a 0 x 2 -j-2a 1 x+a 2 +2a 0 d) 2 — (2Mx-{-N) 2 , . . (1) 

where, it will be seen, the coefficients of x 4 and x 3 agree 
with those of the original, multiplied by a 0 . Comparison 
of the further terms in x 2 , x 1 and x° gives us the relations : 

M 2 = a\— a Q a 2 -\-affl> MN = a x a 2 ~ a^+SaQa^, 

N 2 = (a 2 +2a 0 9) 2 —a 0 a A . . . (2) 


Eliminating M and N by squaring MN and equating the 
result to M 2 N 2 , we find that 


4a^ 3 — a Q Id+J = 0, 

where 


1 = a 0 d x — 4a 1 a 3 +3a|, 


J = 


&q & x CL 2 
d x CL 2 flg 
CL 2 d 2 d x 


(3) 

(4) 


The equation (3), which is usually called Euler’s reducing 
cubic , is typical of the reduction ; for whatever algebraic 
method of solution for the quartic is attempted, sooner 
or later such a cubic resolvent is bound to arise. 

Let Uy Vy w be the roots of the cubic. From one such 
root u we at once find the values of M 2 and MN uniquely. 
Hence M and N are known, apart from a sign which is 
immaterial, since the result gives us simultaneously the 
quadratic factors 

a 0 x 2 -\-2d 1 x-\-a 2 -\-2dQ6±{2Mx-\-N) . . (5) 


of the quartic d 0 f. On solving these quadratics we obtain 
the four roots a, /J, y, 8 of the original quartic. 


Examples. (See also Example 5, p. 139.) 

1. Solve x * + 6z 2 +$x + 21 = 0. 

Here Qq == 1, a x = 0, o> 2 == 1, cfj == 2, cl x == 21, I ~ 24, 
J = 16 and 0 s — 60+4 = 0, which is solved by 0 = 2. The 
uquation now becomes 

(a? 2 -f 1+4) 2 — (2Maj-fN) 2 = 0, 
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giving M ±1 and N = z f 2. Thus we have * 2 4-2*4"3 
and x 2 — 2x -f 7 as factors. 

The roots are — 1±*V% l±i\/6. 

2. Solve the equations 

(i) x*-4x 2 +9x + 4 = 0, (2ikf = -22/ = 3) 

(ii) * 4 4~6* 3 4~14* 2 4-15* 4-4 = 0. 

Obviously the quadratic equations obtained by 
factorizing the quartic in the form (1) above can themselves 
be factorized in terms of the roots of the quartic, so that 
we shall have 

a Q x 2 +2a l x-\-a 2 J r 2a (s u-\-2Mx-\-N = a 0 (x—a)(x—f}), (6) 

a Q x 2 -{-2a 1 x-\-a 2 -{-2a 0 u—2Mx—N = a 0 (x—y)(x—&), 

in order that the product of the two quadratics may be 
a*II(x— a) or a 0 f. Since the four factors x—a may be 
paired in three ways, as (a/?, yS), (ay, jSS), (aS, /3y), we have 
another explanation of the existence of a cubic equation 
for d. The triple pairings will correspond to the three 
possible values u , v, w of 9 when substituted in (6). 

By comparing coefficients of powers of * in (6) we have 

== — ciq(cl ft) , &2 ~\~2clqIi = (i^aft, 

2a 1 —2M = — a 0 (y4-S), a 2 -{-2a 0 u— N = a 0 yS, 

so that a 0 (aft-\-y8) == 2a 2 4-4a 0 w, 

Similarly, a Q (ay+ftb) = 2a 2 4-4a 0 t;, 

a Q (aS+fty) — 2a 2 4-4 a 0 w. . . . (7) 

Since 6 2 is absent in the cubic u+v-\-w must vanish. 
Hence 2(a/?4~yS) — (ay+ftb) — (a8-\-fty) = §u—4v—4w=\2u, 
or 12^ = (a — S) (ft — y ) 4~ (a —y)(ft —8). . . (8) 

This gives a root u of the cubic explicitly in terms of those 
of the quartic. Analogous formulae hold for v and w, 
namely, 

12v = (a-i3)(y-S)+(a-8)(y- i 8) ) 

12w = (a— j3)(S— y)+(a— y)(S— j3). 
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By subtraction we find that 

4 (u—v) = (a— 8)(£— y) . . . (9) 

and two similar expressions for v— w, w—u. Hence, by 
multiplying all three results of this type together, we 
have 


64 (v—w){w—u)(u—v) = (a /?)(y S)(a — y)(S — fi) (10) 

When u = 0 the four numbers a, /?, y, 8 are said to be 
harmonically separated , and the relation is often written as 


{a/J, yS} = 


(a-y)(^-8) 
(a S)(P y) 


= - 1 . 


( 11 ) 


In this case one root of the cubic vanishes : hence J — 0. 
Conversely, if J = 0 the four roots of the quartic form 
harmonic pairs. 

If a — then v — w, as is seen at once from (7). 
Hence both quartic and cubic have repeated roots. By 
51, p. 120, the condition for this is that 

A = / 3 — 27J 2 = 0. . . . (12) 

This expression A is called the discriminant of the quartic. 
It will now be proved that 


§) 2 = 256 J, (13) 


For by 54, p. 126, we have 

II(u-v ) 2 = 108(/ 3 /27— J 2 )/64a® 

= (/ 3 -27J 2 )/16ag 
— A/lOaQ. 

Hence, by (10), 

a«n(a-p) 2 = a* 64 2 /7(^-i>) 2 = 256 A. 

Example. For x* — 1 = 0 we find I = — 1, J = 0, 
403 q_0 — o, and the roots of the quartic are ±1, ± L i t while 
those of the reducing cubic are 0, ±4^- 

* The reader can prove from (13) that A< 0 is the 
condition for two real and two complex roots, A> 0 is 
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that for four real or four complex roots. (Cf. Burnside 
and Panton, Theory of Equations (1899), vol. 1, p. 145.) 

59. Geometrical Aspect of the Quartic Equation. 

Let us consider the locus x = t 2 , y = 2t. It is the para- 
metric form of a parabola S whose equation is y 2 = 4x. If 

S' = ax 2 -\~2hxy-\-by 2 -\-2gx-\-2fy-\-c = 0 . (1) 

denotes another conic, then the points common to the 
two curves S and S' are given by 

at*+2ht z +4bt 2 +2gt 2 +4ft+c = 0 > . . (2) 

which is a quartic equation for t. It has in general four 
solutions a, /?, y, S, giving four points A = (a 2 , 2a), and 
so on, all of which lie on each conic S and S'. 

Now consider the conic 

F = a 0 x 2 +2a x xy-\-a^y 2 -\-2a 2 x-{-2a^y-\-a^ = 0. . (3) 

It meets the parabola where 

a 0 £ 4 +4a 1 £ 3 +6a 2 £ 2 +4a 3 £+a 4 == 0. . . (4) 

Also the points A, B, C, D common to F and S lie on the 
conic 

O = F-a o 0(y 2 -4x) = 0 . . (5) 

for all values of 0. The discriminant of this conic vanishes if 

a 0 a x a 2 +2a 0 9 

&2 ®q9 ^3 *— — 0, . • (6) 

a 2 +2a o 0 a 3 a 4 

as is at once apparent by writing out F in full and re- 
arranging the terms. On expansion this condition turns 
out to be the reducing cubic 

4a o 3 0 3 -a o J0+J = 0. . . . (7) 

The three roots of this equation give three values of 0 
for which the conic O degenerates into two straight lines ; 
and, since the lines must pass through the four points 
A, B, C, D, they can only be the pairs AB, CD ; AC, BD ; 
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AD, BC. In either case 0 resolves into factors linear in 
x and y , or quadratic factors in t. But when x — t 2 
and y = 2t are substituted in the expression 0 the terms 
involving 6 disappear, so that 0 takes the same form as 



F ; that is, 0 becomes the quartic in t. Hence we have 
factorized this quartic into two quadratic factors by 
choosing 6 to satisfy the reducing cubic. 

60. Canonical Forms of Even Order. Suppose we 
have 

a 0 v+a 1 u—a 2 — 0, a l v-\-a 2 u—a 3 — 0, a 2 v-{-a z u— a x = 0. (1) 
Then the coefficients a { of the quartic 

a 0 x 4 +4o 1 a; 3 +6a 2 x 2 +4a3X*+o 4 . . (2) 

satisfy the recurrence relation (1), which is the recurrence 
relation used also in the case of the cubic in 56. These three 
equations for u and v can be simultaneously true only if 
a Q a t d 2 

* «7 = cq a 2 = 0. * , • (3) 

a 9 a z a x 
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In addition to all the results of 56 (11), p. 128, we now 
have pa 4 +gj8 4 = a 4 . Hence 

a 0 a; 4 +4a 1 a; 3 +6a2a; 2 4-4a3a:4-^4 = p{x—a) l -\-q{x—\ 8) 4 . (4) 

This shows that when J vanishes, the quartic is expressible 
as the sum of two perfect fourth powers, and conversely. 

The determinant | a 0 a 2 a 4 ...a 2n | of the 2n-ic form 
a 0 x 2n +2 na^x 2n ~ x -f (2 n) ( 2) a 2 x 2n “ 2 + . . . +a 2n . (5) 

is called the catalecticant of the form. In the writings of 
Cayley and Sylvester such a form was often denoted by 

(a Q , %, ..., a 2n \x, l) 2n ... (6) 

If the catalecticant vanishes the form, or quantity is 
expressible as the sum of n perfect 2nth. powers of linear 
forms x—a , each with a constant coefficient, where a is 
a root of the equation 

| a 0 a 2 ...a 2n _ 2 (-x) n | = 0, . . (7) 

the last row in this determinant being understood to be 
1, —x, x 2 , ..., (— x) n y the first row being a Qi a v ..., a n . 

Next consider the quadratic 

F = )#-f-6-|-A6*. • (8) 

It is a perfect square when 

(a+Aa')(6+A6 , )-~(^+M , ) 2 = 0, . . (9) 

where (9) is a quadratic for A in terms of the other quantities. 
It follows in general that, for two values A x and A 2 of A, 
we can write the quadratic F in the form p(x—a) 2 . 
Accordingly, let 

/ = ax 2 -\-2hx+b i /' = a'x 2 +2h'x+b ', 
and f+\f = p(x-a) i , f+\ 2 f = ?(*-£) 2 - • (10) 

On solving these two relations for / and/' we obtain 
/= r(x-a) 2 +s(x- yS) 2 , /' = r’(x-a) 2 +s'(x-B)\ . (11) 
where r, a, r', a' are suitable constants. 
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Again, since a quartic can always be resolved into 
two quadratic factors / and/', in three ways, we can reduce 
such a pair of quadratic forms to the present form, and 
thus express their product, the quartic, as 

A(x-a)*+§H(x-a) \x-p) 2 +J3(z-j8) 4 , . (12) 

where A = rr' } 6H = rs'+r's, B — ss'. 

This is the usual canonical form for a general quartic 
— the sum of two fourth powers and the product of their 
squares, with constant coefficients. Evidently our earlier 
case, where J was zero, corresponds to the possibility that 
the coefficient H may vanish. 

This canonical reduction of the quartic typifies also 
the reduction for the general 2n-ic ; namely, that the 
general 2n-ic can be expressed as the sum of n perfect 
2?ith powers of linear forms, affected by constant coefficients, 
together with a term involving the square of the product 
of these forms. 

61. Gregory’s (1675) Method of Solving a Cubic 
or a Quartic Equation. Let us consider the equation 

x z -\-qx-\-r — 0. 

Put x = z+v, 

so that v 8 +3zt; 2 + (3z 2 +g)v+z 3 +gz+r = 0. . (1) 

Now multiply this cubic by the arbitrary cubic 

v z +av 2 -\-bv-\-c . . (2) 

and in the resulting sextic for v equate to zero the co- 
efficients of v 6 , v 4 , v 2 , v. 

Then 3z-f a — 0, 3z 2 +g+3za+6 = 0, 

az 8 +agz+ar+36z 2 +6g+3zc = 0, 

6z 3 +6gr+fo*+3z 2 c+gc __ q (gj 

Solve these equations for a, 6, c in succession. 

Then a = — 3z, b = 6z 2 — g, and c = (g 2 +3zr— 15z 4 )/3z. 
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Substitute in the final equation and we find that 

27z*—27z 3 r—q 3 = 0. . . (4) 

This is a quadratic for z 3 , whence z is found, and, with it, 
a, b and c. The sextic for v is of the form v 3 +Av 3 -\-B = 0, 
which can now be solved. Finally, x = z+v. 

Next consider the quartic xt-\-qx 3 +rx+s = 0, with 
x = z+v, so that 

+ 2gz+r)v-f-2 4 -4-g2 2 +rz-f« = 0. (5) 

Multiply this by the arbitrary quadratic v 2 +av+6, 
choosing a and b such that the resulting sextic in v may 
have no term in v 5 or v 3 or v. 

Then 4z+a = 0, 4z 3 +2<7z+r+6z 2 a+ga+46z = 0, 

a(z 4 +^z 2 +rz+5)+6(4z 3 +2gz+r) == 0. . (6) 

Hence a = — 4z, b = (20z 3 +2gz— r)/4z 
and 64z 6 +32gz 4 +(4g 2 — 16s)z 2 — r 2 = 0. . . (7) 

Solve this cubic for z 2 , whence z, a, b are found. Next 
solve the sextic for v , which is of the form 

v 3j rAv l -\-Bv 2 +G = 0 , 

and so is a cubic in v 2 . Finally, x = z+v. 

Next, the quintic equation, £ 6 +£a; 3 +rx 2 +sa;+$ = 0. 
Gregory put x = z+v as before, and multiplied the quintio 
in v by an arbitrary form of degree 15. 

He equated every coefficient in the resulting 20-ic to 
zero, except those belonging to v 20 , r 15 , t; 10 , v 5 , v°. This 
gave him sixteen equations for the fifteen unknown co- 
efficients a , b y ..., k and for z. If they could be solved , 
then v 5 would depend on a quartic and the value of x 
would be found. 

Death intervened (in 1675) before he had completed 
the full investigation. Actually the eliminations would 
have led to a sextic equation for a power of z, but the 
matter was not cleared up until the time of Abel early 
in the nineteenth century. So sure was Gregory that he 
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had discovered the general algebraic method of solving 
all equations that he was in correspondence with his 
friends Collins and Dary, arranging for the calculation of 
the eliminations for all equations up to the tenth degree ! 

Abel proved that the resolvent of any equation of 
degree n>4 was an equation of degree higher than n ; 
and this epoch-making result established the impossibility 
of solving general algebraic equations of the fifth or higher 
degree by rational processes and explicit root extractions 
in a finite number of steps. 

Examples. 

1. Solve (i) 2a?— 14a; 2 + 8 = 0. 

(ii) 6a; 4 — x 2 -x-\- \ = 0. 

2. If a+j8 = y+8 for the general quartic, prove that 

— 3« 0 nr 1 a 2 +2a3 = 0. 

Solve a; 4 — 8a; 3 -f 8a; a -f 32a;— 44 = 0. 


3. Solve 

X 

a b c 


X 1 

. 

. 


a 

x . . 

= 0, 

1 2x 

1 



b 

. x . 

. 1 

2x 

1 


c 

. . X 


. . 

1 

2x 


The latter determinant is cos 40 if x — cos 0 ; hence 
rn 

x = cos — «, r = 1, 3, 6, 7. Generalize. 

8 

4. Express the discriminant of a cubic or a biquadratic 
as a determinant in the s i (p. 80). 

5. The following alternative form of Ferrari’s method is 
useful in solving numerical examples. 

Solve a; 4 + 4a; 3 -f 8a; 2 -{-7# +4 = 0. 

Let this equation be written 

(x 2 +2x +t) 2 = (aa;-f&) a . 

Then, on comparing coefficients, 

2t—a 2 = 4, 4t — 2ab = 7, t 2 —b 2 = 4. 

Whence 4(2t-4)(* 2 -4) = 4a 2 6 2 = (4*-7) a , 

|o that 8* 3 — 32^+24^ + 15 = 0. 

A rational factor of this cubic is 2t—5, yielding t = 2J, 
= and leading to rational quadratic factors 

(a; 2 -f-a; + 1 )(a? a + 3a; + 4) of the quartic. 
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ELIMINATION OF THE UNKNOWN FROM 
CONSISTENT EQUATIONS 

62. Dialytic Elimination. From n separate equations 
it is generally possible to eliminate n— 1 unknowns. The 
following, which is called by Sylvester the dialytic method, 
provides a systematic means of eliminating one unknown 
from two equations. From the reverse point of view 
the vanishing of the eliminant may be regarded as the 
condition under which the two equations in question 
possess a common root. 

Consider the case 

ax 3 +bx 2 +ex~\-d = 0, px 2 +qx-\-r — 0. . (1) 

Multiply the cubic by x } and the quadratic by x and also 
by x 2 y so that five relations are obtained : 

ax*-\~bx 3 -\-cx 2 -\-dx — 0, 

ax 3 -\-bx 2 -\-cx+d = 0, 

px 2 +qx-\-r = 0, . . (2) 

px 3 +qx 2 -\-rx = 0, 

px x -{-qx 3 -\-rx 2 = 0. 

If we treat these as five linear equations, homogeneous 
in x 4 , x 3 , x 2 , x, 1 , the condition for their consistency is, 
by the theory of linear equations (Aitken, Determinants 
and Matrices } p. 64), 

abed. 

.abed 

|i?| == . . p q r = 0. . . (3) 

. p q r . # 

p q r . . 
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Since the term involving a 2 in the expanded form of this 
determinant \R\ is easily seen to be — aV 3 , the expansion 
does not vanish identically. But it is free from x : hence 
it is the required eliminant, or resultant . 

Similar methods apply to a pair of equations of degrees 
m and n : corresponding to the m-ie there are n rows in the 
determinant, corresponding to the w-ic there are m rows. 
In all there are m-\-n rows. This type of determinant 
is usually called (Aitken, Determinants and Matrices , p. 125) 
a bigradient. 

From three equations, consistent in two unknowns x and 
y , we could eliminate x from the first and second, and again 
from the first and third equation, giving two resultants 
containing y , from which y could be eliminated by a like 
procedure. 

63. Factorized Form of the Resultant. Suppose 
that we have 

ax s ~\-bx 2 +cx-j-d = a(x—a)(x—f3)(x—y) =/(r), 

px 2 +qx +r = p{x—\){x~ii) — <f>(x). * ^ 

Then, when both f(x) and (f)(x ) vanish simultaneously, one 
of A, n must equal one of a, /J or y ; and so the following 
relation must exist : 

| jR' | = ( A —a) (A — £) (A —y) {p —a) {fi — £) (fx —y) = 0. (2) 

This can be written in terms of alternants (p. 47) as 

A(\ixaPy)IA(\p)A(afiy) = 0, . . . (3) 

or again, since /(A) = a(A— a)(A— y), and so on, 

pW\R’\ = p*f( X)f(p) = aV(a)^W(y) = 0. . (4) 

We have evidently hit upon several alternative forms of 
fesultant for / and <£, and the question arises, how are these 
related to the bigradient |#| ? The answer is as follows : — 



Ui 


Elimination 


By row-into- column multiplication we have 


abed. 


a ! 4 /3 4 / A 4 p* 


. • • A/a iifjjL 

.abed 


a 3 f} 3 y 3 X 3 p 3 


fx fn 

. . pqr 


a 2 p 2 y 2 AV 2 

— 

0a 0£ 0y • 

. p q r . 


a /? y A /i 


a 0 a £0j3 y0y • 

pqr.. 


11111 


a 2 0 a ^ 2 0/3 y 2 0y • 


where 0 a ==0( a), etc. On taking determinants we 

have 

\R\\R'\A(af3y)A(Xf,) = -Mrf'fipfrAiapyMln). . (6) 

But |-R'| = /A //i/fl 2 = <]>a<l>P<f>ylp 3 . 

Hence |/J||.R'| = —a 2 p 3 \R'\ 2 , or |J?| = — a*p 8 |.R'|. . (7) 

The alternative forms of the resultant |7?| = 0, \E'\ == 0 
are therefore clearly equivalent, as was pointed out by 
Professor E. T. Whittaker (Proc. Edin. Math. Soc. 3 Series 1, 
vol. 40 (1922), pp. 62-63) who gave a proof substantially 
the same as the above. 

64. B6zout’s Condensed Eliminant. Inl779B6zout 

gave a method which produces an n-rowed determinant 
as the eliminant of two equations of degree n. For 
example, consider the case n = 3 for two cubics 

f(x) = a 0 x 3 +a 1 x 2 +a 2 ^+« 3 = 0, 

<f>(x) = b 0 x 3 -\-b 1 x 2 -\-b 2 x-\-b z = 0. 

Multiply these two equations successively by 

b Q and a 0 respectively, 
b 0 x~\-b x and 

b ( p 2 -\-b 1 x-\-b 2 and a 0 x 2 -{-a 1 x-\-a 2J 

and subtract each time the products so formed, 
results are the three following equations : 

KM** + KM* + KM = 0. 

KM** + (KM + KM)* + KM = °» 

KM* 2 + KM* + |«2& 3 ! = 0. 


• ( 1 ) 


Then the 


• (2) 
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where represents the determinant , and so on. 

b 0 Oj 

By eliminating from these three equations x 2 > a; as distinct 
variables, we obtain the resultant as a three-rowed 
determinant equated to zero, thus, 


\ a 0 b l 1 

I^O^I 

l a o^3l | 


Mai 

Msl + Wal 

CO 

II 

© 

. (3) 

\ a 0 b 3 \ 

|ai6 3 | 

| a 2^3l 1 



This determinant is called B^zout’s eliminant, sometimes 
the Bezoutian or Bezoutiant. The case for a general value 
of n is constructed in similar fashion. 

65. Relation between the Dialytic Eliminant and 
B6zout’s Eliminant. Taking first for illustration the 
case m = n = 3, let us consider the bigradient resultant 
matrix 

CIq di CL 2 

. ( Iq cq d 2 ^3 

R = .. a 0 a ± a 2 a 3 # # (1) 

. . b 0 b x b 2 b 3 

• b 0 t>i b 2 b 3 

b 0 b x b 2 b z . 

Premultiply R (in ordinary matrical, or row into-column, 
multiplication) by a suitably partitioned matrix 

1 

1 . ... 

1 ... 

. . 

b 0 °h a o 
b 0 b 2 a o 
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The result is 


KE 


Uq % 

Cl 2 

«3 

. 

. 

. (Xq 

a l 

d 2 

03 

. 


a o 

<h 


0 3 


. 

Mil 

|a 0 ft 2 | 

|M3 


. 

|a 0 6 2 | 

K 6 3 |+I«AI 

Ms 

. 

. 

IM 3 I 

l a AI 

|a 2 6 3 


( 3 ) 


The determinant \K\ of K is evidently a^, and the deter- 
minant \KK\ is likewise evidently multiplied by the 
three -rowed determinant of the elements in the lower 
right-hand corner. Hence, by taking determinants of 
both sides and cancelling we have 


\B\ 


Mi I 
Mai 

| ^0^3 1 


|a 0 6 2 | 


Mai + Mai 
Mai 


iMa I 
Mai 

\dJ)o 


w 


which shows that the bigradient and the Bezoutian 
eliminants are identical in value. 

The elements of the Bezoutian contain minors of the 
2nd order and of weight (in the (i, ^th element) i+j— 1, 
these minors being taken from the rectangular two-rowed 
matrix 


a 0 a ! a 2 ... a n 1 
6 0 b z ... 


( 5 ) 


this last matrix being most obviously connected with the 
identity 


(6) 


|"a 0 a 2 ... a n 1 

a" 



f°l 

K b i 6 * - b A 

a” -1 


|oJ 


a"' 2 




i 




where a is a common root of the associated equations whi&h 
give rise to the eliminant. 
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66. Case of Quantics of Unequal Degrees. As for 

the case when m>n, it is easily derivable. In the trans- 
formation KR of 65, put b Q = 0, and then alter [6 X b 2 63] 
to [6 0 b x b 2 ]- This gives 


Q/q d X d 2 d 3 . 


b 0 b x b 2 

• « 0 a l 



. . b Q b x b 2 

= 

d 0 b x d 0 b 2 +\a x b x \ \d x b 2 \ 

. b Q b x b 2 . 



^0 ^1 ^2 • 


dffi 2 {&1&2I 1^2^21 


(i) 


where the elements in the second and third rows are to be 
regarded as minors of the array or matrix 


% ^3 

b 0 b x b 2 


( 2 ) 


If we introduce a factor a 0 and write the top row of 
the eliminant as [a 0 6 0 , a 0 b X) a 0 b 2 ] the result is more 
symmetrical, while it still gives a genuine eliminant, since 
a 0 ^ 0. 

For the general case, where m—n = r, we take the 
fundamental array to be 

[ a 0 a i a 2 ••• a r a r + 1 ••• a m\ / q \ 

- M - - 131 

by putting the first r coefficients b f in 65 (5) equal to 
zero and renaming the suffixes of the remaining bj so that 
they become b Qy b x , ..., 6 n . 


Examples. 1. Obtain the dialytic and the Bezoutian 
form of the eliminant of (i) two quadratics, (ii) two quartics, 
(iii) a quadratic and a quartic. 

2. If f(x) and g{x) are polynomials, prove that 

{f(x)g(y) -J(y)g(x))Kx-y) 


is also a polynomial. 

, 3. Write out this derived polynomial of Ex. 2 in powers 

of y, when f (x) is a cubic and g(x) a quadratic. By equating 
coefficients of powers of y to zero, and then- eliminating 

K 
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powers of x in the results, obtain the Bezoutian eliminant 
of f{x) and g(x). 

4. Derive on the model of Ex. 3 an alternative method 
for obtaining the Bezoutian eliminant of two equations. 

67. Elimination and the G.G.M. Process. Still 

another way of obtaining the resultant |i?| of two poly- 
nomials f(x) and <f>(x) is to perform the Euclidean G.C.M. 
process upon them. For example, if f(x) and <f>(x) are the 
cubics given in the equations 64 (1), the process begins 
with dividing <f>(x) by f(x). To avoid fractions, multiply 
by the non-zero constant a 0 : then the first quotient 
is b 0 and the remainder a quadratic whose leading term 
is \cLqWx 2 . Before the next division multiply the dividend 
by |a 0 6 x |, again to avoid fractions, and continue in the 
same way. If / and <j> have no common factor involving 
x the remainder will be a constant \R\ t a polynomial 
expression in the coefficients cq and b t . As in 17, p. 38, 
we shall have an identity 

Af+B<f, = \R\. . . . (1) 

Now if the equations 64 (1) are simultaneously true 
when x — a, then f = <f> = 0 at this value of x and hence 
\R\ = 0. But \R\ is independent of x: hence \R\ = 0 
for all values of x and is in fact the same resultant as 
before. Conversely, if |i?| does not vanish it is impossible 
for / and <f> to have a common factor. 

This method gives the necessary and sufficient condition 
for f and (f> to have not only one root in common but also 
k roots in common. For if the intermediate remainder 
which is of degree k in x does not vanish identically while 
the next, which would usually be of degree k— 1, does 
vanish identically, then the G.C.M. is of degree k t and 
exactly k roots are common to / and <f>. The condition is 
therefore given by equating each of the k coefficients of 
powers of a; in this remainder to zero. 

It is interesting to know that all these remainders can 
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be given in dialytic form through a formula due to Cayley 
(1848). For two cubics, 64 (3), it is 

a Q a x a 2 a 3 . x 2 f{x) 

. a 0 a 1 a 2 a 3 xf(x) 

l-K | = . . a 0 a x a 2 f(x) . . (2) 

. . b 0 b x b 2 (f>{x) 

. b Q b x 6 2 6 3 x<f>{x) 

b 3 b j b 2 63 . x u (j)(^x) 

Here the sixth column can at once be reduced to 

{ . . « 3 b 3 . . } 

by subtracting x 5 col l5 and so on : hence \R\ is the dialytic 
eliminant of Sylvester. Delete row l5 row 6 , col x , col 6 from 
\R\ and the resulting four-rowed minor gives the pen- 
ultimate remainder \S\. Delete row x , row 4 , col^ col 3 of 
\S\ and the first remainder is obtained. For a full 
discussion see Muir, History of Determinants (1920), vol. 3, 
pp. 329-349, which comments on Trudi’s paper of 1862. 

Miscellaneous Examples 

1 . Between what values must the quantity m lie if the 
roots of the equation 

x 2 -{-2x(m -J-5) + 2m 2 = 0 

are to be real ? 

2 . Draw the graphs of x 2 and of 8— x 3 , and hence find 
approximately the real root of the equation x z -\-x 2 — 8=0. 

3 . Find the equation whose roots are the squares of 
those of rc 3 -f #4-1=0. 

4 . If f(x) is a polynomial of order n in x , prove f(x)f(—x) 
is one of order n in x 2 . 

6. If y—x 2 j and f(x)f(—x) = <f>(y), what are the roots of 
$(y) — 0 ? [The squares of those of x . 

% 6. Solve 27 a; 4 — 45 a; 2 -f 28 a ;— 4 — 0 , which has a repeated 

root. [~2 2 — 2d:\/^ 

l_ 3 ’ 3 ’ 3 # 
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7. Solve a^ — Qx 2 +9x = 3 by taking x = 2 + 2 cos Prove 
that one root is 4 cos 2 10° and find the other roots. 

8. The sextic equation, x*-\ -ax 3 -\-bx-\-c = 0, has three 
roots equal to a. Prove that c = —5a 8 . Find also a and b 
in terms of a, and the cubic equation which gives the other 
three roots. 

9. If a is a root of x 3 = 2, show that l+a+a 2 is a root 
of y 3 — 3y 2 — 3y — 1 = 0, and solve the latter equation. 

10. Determine the nature of the roots of 

(i) 3x 3 +x 2 —llx+6 = 0, 

(ii) 2:r 3 -f3:r=4, 

and evaluate their real roots to three significant figures. 

11. Complete the solution by Cardan’s method for the 
equation # 3 -f 12# — 112 = 0 by assuming that 7±5\/2 = 
(a±j8\/2) 3 , where a and p are integers : 

[a = p = 1, x = 4 ; etc.] 

12. Prove that there cannot be two dififering identities 
S = S = «/r(e) expressing S, a given polynomial sym- 
metric function of the roots x lf x 2 ... , x n as a polynomial 
in their elementary symmetric functions e x — Lx „ e 2 = Lx { x j9 

. . . , 6 n = X X X 2 . . . X n . 

By subtraction this would imply a nonzero polynomial F(e) 
in the e’s, which would vanish identically when expressed in 
terms of the x's. Let T — Xe 1 p e 2 ( Je 3 r ... be any nonzero term 
of F(e) ; that is A^O. Expressed in terms of x’s let each 
term T be arranged in descending lexical order w ith regard 
to x v x 2 , ... , x n . Then T w ill necessarily contain a leading 
termX = Xx x p x 2 ^x 3 R ... , where P — p+q+r- f- ... , Q = g + 

... R — r+ ... , etc. 

Since p, q , r, ... are fixed uniquely if P, Q, R, ... are 
given, no two terms T can have the same leading term. Put 
the leading terms X, one from each 7\ also into descending 
lexical order. Since they all differ they too have a single 
leading term. This term therefore stands foremost of all 
the terms in x among all the terms T. Hence it is unique 
and cannot possibly cancel out from the assumed idontity ; 
which is absurci. Thus no such identity exists, and thg 
reduction of p. 75 is unique. 
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FURTHER LIMITING AND APPROXIMATE PROCESSES 

68. Explicit Formulas for the Roots of Equations. 

Consider the polynomial F of the nth order, as given on 
p. 71, where 

F(x)=(l — cur)(l— /fo)...(l— A:r)=l — e 1 x+...+(— ) w e n £ n , (1) 
and 1 /F(x) = h 0 -[-h 1 x J r h 2 x 2 -\-... 

Multiply this last relation throughout by x n ~ x and use 
(1) p. 47. Then we find that the left-hand side expression 
yields 



l 

1 


1 1 

x n ~ x 

a 

P 


a 


a "” 2 



a n -2 fin - 2 





tt”-l P n ~ 1 


while h r is now the coefficient of x n+ r ~ x on the right. Expand 
each of the n elements in the bottom row of this numerator 
determinant in ascending powers of x (as in (2) p. 70). 
Since x v occurs in the lowest row only of this determinant 
and then along with a v etc., the coefficient of x p on the 
right has for numerator the n-rowed determinant 


1 1 

a 


a p 


== A(0 y 1, ..., n-2, p) 


( 3 ) 


say, while the denominator is a particular case of this 
with p = ft— 1. For lower positive integral and zero 
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values of p this alternant A vanishes identically, having 
two equal rows. For higher values it duly gives the h 
expressions in bialternant form 

h r = A( 0, 1, 2, n- 2, n+r-l)/il(0, 1, 2, n-1) (4) 


Example. 



1 1 1 


i i i 

For a cubic, h 6 — 

a P y 

a 7 jS 7 y 7 

4- 

<* P y 

a> p / 


, 4 ( 017 ) 


Now suppose that among the n real or complex roots 
a, j8, ... the modulus |a| of a exceeds that of each of the 
rest. Then it follows that the ratio h r+1 /h r tends to a 
itself as r tends to infinity. For since |a|>|/3|, therefore 
|/?/a| r ->0. Hence p r ]a r -+0 also, and likewise for each 
further root compared with a. On dividing each element 
of the bottom row in the numerator of both h r+l and h r 
by a r and then proceeding to the limit, the result follows. 
Thus, with n = 3, 


hr+i 

K 


J(0,l,r+3)M(012) 

A(0,l,r+2)/A(Ql2) 

_a s B(0l) 

~ a 2 B(01) ~ a ‘ 


1 

1 

1 


1 

i 

1 

a 

P 

y 

-7- 

a 

P 

y 

a 3 

0 

0 


a 2 

0 

0 


where 2?(01) denotes the two-rowed alternant of j8, y. 
Similarly for n in general. 

The same property holds in the confluent case provided 
that a is a repeated root whose modulus again exceeds 
that of each of the rest. Suitable confluent determinants 
now appear in (2) as they did on p. 48. Thus if a is re- 
peated exactly k times, the first k columns in both these 
determinants belong to a. The preceding argument for 
the limit then clears the bottom row of each determinant 
as before, except in these first k columns. Further division 
of these bottom rows by r* -1 , before taking the limit, 
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leads to zeros in all but the k th column of each determinant, 
and to the result a as before. Thus, with n = 3 and just 
two equal roots, 


1 

• 

1 


1 

i 

a 

1 

y 

-r 

a 

1 y 

a f+3 

(r-|-3)a r+2 

y r + 3 


a r+2 

(r+2)a r+1 y r+2 



1 . 1 


1 

-> 

a 1 y 

-r* 

a 1 y 


. a 2 . 


a 


This has proved the following theorem : 

Theorem. When the modulus of one root , repeated or 
unrepeated , exceeds that of each of the remaining roots , then 
the ratio h r+1 /h r tends to this root for its limit as r tends to 
infinity. 

Let us call this the regular case, and denote all others, 
where the greatest modulus belongs to two or more distinct 
roots, as irregular. Evidently these last include the case 
when the greatest modulus belongs to a complex root 
(and therefore to its differing conjugate complex root) in 
a real equation. Two theorems for the regular case can 
now be proved. The first is a simplified form of one by 
Daniel Bernoulli (1728) : the second, which marks an 
epoch in the long history of equations by giving an explicit 
formula for a root, was discovered by E. T. Whittaker in 
1918 (. Proc . Edin. Math. Soc. 1, 36 (1918) p. 103). 

Corollary. By an algebraical , followed by an arithmetical , 
division the root with greatest modulus , in the regular case , 
can be found to any degree of accuracy. 

Proof. Divide unity by F(x), using algebraical long 
division, and so obtain consecutive coefficients h ry & r+1 , ... 
in the quotient. (As in Horner’s method there is no need 
to write down the letter x , only the numerical values of 
the terms.) At any suitable stage divide h r+1 by h r , and 
the resulting quotient is an approximation to such a root, 
by the preceding theorem. 
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Examples. 

1. If F(x) = 1— 9x — 10# 2 4- ll^ 3 , the h series is 1, 9, 91, 
898, 8893, 88016, ... by long division. Also 88016-4-8893 = 
9*897..., within *001 of a. Horner’s method gives 9*898 .... 

2. Prove that the theorem holds even when any finite 
polynomial replaces unity in the numerator. What happens 
if this numerator contains a factor in common with the 
denominator ? 

Whittaker' 8 Theorem. If the equation 
0 = e 0 — e^+essX 2 — e 3 x 3 +e 4 x 4 — ... , (e 0 = 1) 

has a root a -1 which is smallest in absolute value, then 


e 2 e 3 
6 s> 


a e x e x 

p p |T 

e i e 2 

e i e 2 e 3 


e o e il 

e o e i 

Cq e x ©2 




. e 0 e a 


'provided that the series converges. 

Proof. Let this series be written in the notation 

-° + -2s- + + (8) 

e i e l e ll e il e m e lll e llll 

where the rule of suffix formation is obvious, and the 
connexion between multiple suffix and the corresponding 
determinant is defined by the relation 


&pqr 


e v e <7+l e r+ 2*** 
e v-l e Q e r+l *•• 
e j>-2 e Q- 1 e r ••• 




(9) 


The principal diagonal e V) e q , e r , ... of the determinant is 
at once fixed by the multiple suffix in e vqr . . . . Each column* 
is characterized by descending and consecutive suffixes. 
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At present we shall only need the case when all suffixes 
p, q, r are equal within a multiple suffix. As in the first 
column of e m above, negative suffixes cannot occur : 
instead, zero elements appear. 

By a theorem on dual bialternants (Aitken, Determin- 
ants and Matrices , p. 117) we have the identity 

e VQ r ... = K p ( 10 ) 

where {pqr ...} and {rr p ...} are conjugate partitions (p. 2) 
of the same positive integer, and h V p ... is constructed 
analogously to the determinant (9). Simple instances of 
this have occurred at (2) p. 71, which yield e 1 = h v e 2 = h U) 
h 2 — e lv h 3 — e in etc. The graph, as on p. 2, shows that 
e 22 — h 22 and e 222 — h 33 . On translating from e to h the 
first r terms of the series (8) now become 

K+hk + h* + ... + *izim 

h\ hfl 2 ^2^3 ^r-1 

= r + (r ~r) + (r ~r) + - + (V - H 

h x \h 2 hj \h 3 h 2 ] \ h r h r _ 1 / 

~ h r -ijh r . 


But this tends to a -1 as r tends to infinity, by the Theorem 
of p. 151, and so proves the theorem. 

Corollary. If a finite number of hj vanish but all h p+i 
(i>0) are non-zero , after a suitably chosen value p, then 


i = Vi + _ h p_p_ + 

a h p hphp+i 


. . ( 11 ) 


This follows by the same methods of proof. We may 
note that the theorem and corollary cover the regular 
^ase. For in this case the limit of h r _ x \h r exists so that 
only a finite number of terms could vanish. The modi- 
fied series (11) is automatically convergent. 
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69. The Irregular Case. In 1924 Aitken * extended 
this result to give the elementary symmetric functions of 
a subclass a -1 , j3 -1 , ... of the roots, such that each modulus 
|a|, |/?|, ... exceeds each modulus of an omitted root. 
This covers the irregular case wherein several distinct 
roots in the subclass have the same maximum modulus. 
Not merely the elementary but all the usual symmetric 
functions may be derived by such series. They depend 
on the ratio h P \liQ where P and Q are suitably chosen, 
equally numerous, multiple suffixes. 

To illustrate this let a^jS, but |a| = |j8|>|y|, where 
y denotes any other root. This case of two such equal 
moduli needs the double suffix function 


h 


rr — 


h r h r+1 
h r - 1 h r 


and the identity Jiy Tiyy r — hyy hyy hy_^ r—i r+i> 
which arises at once by applying Jacobi’s theorem of the 
adjugate (cf. Aitken, Determinants , p. 98) to the cofactors of 
the four corner-elements in the three-rowed determinant 
h rrr . 

In fact we have 


*'r + V r+1 


K + l^_K\ + _ + IK 

"'ll \™22 h n ) \h r +ly 

hi h ni ( , h r h rr , 


h r —l ^r-l \ 
hr r ) 


r+l 


= A + 


h n h 2 2 


+ ... + 


h r r h r+ 1, f+ i 


by repeated use of Jacobi’s theorem. Also by a Lemma 
which follows below, the left-hand side expression tends to 
1/a/? as r oo, while the right yields the infinite series in 
h or e at pleasure. Thus, in the present example, 

1 = e_ 0+ £ L e3 + e 1L e33 + (12) 

ap e 2 e 2 e 22 e 22 e 2 22 # 

* A. C. Aitken, Proc, Royal Soc. Edin. t 46 (1925), p. 14 ; 46 
(1926), p. 289. 
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From the limit of h r _ v r lh rr we may similarly deduce that 


= Jh ^2 ^in 

a /? hn hn h /22 


+...= - i + 


e il 6 3 


+ 


^2 ^22 ^22 ^222 


'+... ( 13 ) 


From (12) and (13) the values of a and /? can now be found. 

Lemma. The limit of h rr /h r+1> r+1 = 1/a/? as r -> oo (14) 

Proof. We use Jacobi’s fundamental identity for 
bialternants, 

Kt-r = A(P, q+1, ...,r+n—l)IA(0,l, ...,n—l) . (15) 

Cf- Aitken, Determinants , p. 116. For brevity take n — 4, 
and four distinct roots a, /?, y, S. Then 


1111 


1111 

a j8 y 8 


a /? y 8 

a r+2 pr+2 y r+2 §r+2 


a r+3 pr + 3 yr+ 3 gr+3 

a r+3 £r+3 yf+3 §r+3 


a r+4 jgr+4 ^r+4 gr+4 


Expand each determinant by a Laplace development of the 
top n— 2 rows with the bottom two rows. Then divide both 
expansions throughout by a r /? r . Since [(y or 8)~(a or /?)] r 
will occur in every term of each expansion, except the first 
term, the first alone of each will not tend to zero as r->co . 
Hence the quotient tends to 


1111 


1111 

a )3 y 8 

•1 

a j8 y S 

a 2 j8 2 . . 

* 

a 3 p 3 . . 

a 3 fi 3 . . 


a 4 p* . . 


which reduces to 1/a/?. Similarly for (13). The confluent 
case presents no difficulty if taken as in (6). 

Examples. 

* 1. Prove that a= * - ± - is. - -$2- - ... for the 

e 0 e x e 1 e ll 

greatest |a|. 
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2. Prove that a" 1 = — + H + ... . 

e il 6 li e ill e ill 6 llll 

3. If x n —e 1 x n - 1 —e 2 x n ~ 2j r , prove that 

X n+P-1 _ hpX 11 - 1 —h vl x n ~ 2 +h Pll x n ~* ~ ... . 

[Hints: 1. Use h r+1 /h r . 2. h r jh r+2 ] 

70. Other Approximate Methods for Solving 
Equations. For further details on numerical solutions 
of equations the reader should consult Whittaker and 
Itobinson’s Calculus of Observations (1926), pp. 78-132. 
But two more of the early methods may be mentioned 
here for their simplicity and interest. 

Anderson’s Method.* To solve f(x) = 0 having 
found a fair approximation x = a, put x = a(l~\-y)/(l—y) 
so that y is necessarily small. Obtain the corresponding 
equation for y but neglect third and higher powers of y> 
say p+qy+ry 2 = 0. Then y — —p/q nearly, so that 
y — — pl(q+ry) = —p/iq—pr/q) more nearly. From this 
a greatly improved value of x is obtained. The method is 
powerful and applies to equations algebraical or tran- 
scendental, and can of course be iterated. Anderson 
claimed that it solves x x = d more easily than x 3 = d. 

Example. Solvo x 3 — 2x — 5 . Here x = 2 is a good start : 
therefore put x — 2(1 -\-y)l(l — y) : whence 
1 = 432/4- 13i/ 2 + 9y 3 . 

Thus y = — to a first approximation, and next y = 1/(43 + 1 3 y) 

= 43/(43 2 + 13) = 43/1862. This gives a = 3810/1819 = 
2*094557..., which is within *00001 of the true result 
2*09455148 .... 

Maclaurin’s Method. After taking an approximate 
value a, put x = a+y, so that y is small. Proceed as 
before but retain powers of y beyond the second if neces^- 

* G. Anderson (1739), Letter to Jones, Rigaud’s Correspondence of 
Scientific Men (1841). 
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sary. Iterate. The process is more laborious than Ander- 
son’s to the same accuracy. 

Example. For x 2 — I5x 2 + 63# — 50 — 0 take x — 1+2/ 
where y is small. This gives 1 = %§y — \2y 2 -\-y z or y — 
1/(36 — 12y+y 2 ). Honce y = 1/36 nearly, and, better still, 

y = 1/(36- ^ = -02803. On putting y = • 02803 +z 

and solving for z but neglecting z 2 and higher terms, the 
result is x — T 02803923127 very nearly. 

71. Newton's Limits for the Roots. After stating 
in the Arithmetica Universalis * the rules for the sums s v of 
powers of the roots (as quoted on p. 72), Newton gave a 
series of interesting theorems including the following : 
(1) ^js r -> a the greatest root as r -> oo, (2) 5 r-1 s r+1 > s r 2 
when r is odd, (3) ^[J\/( 5 r-i 5 r+i)+l 5 r] a * Newton 
confined these to equations all whose roots were real. 
These investigations so early in the development of the 
subject are interesting, particularly when they are com- 
pared with the results given above, using the h and e 
symmetric functions. The limit of s r+1 /s r could equally 
well have been used (which would have led to a new but 
more complicated type of determinantal series on the 
basis of (7) p. 74), but, on the contrary, the r th root of 
h r would not tend to a limit. 

Newton’s rule, given above on p. 97, follows in the 
Arithmetica immediately after this discussion of s r . 

72. Newton's Rule of Signs. To discover the com- 
plex roots Newton thereupon gave a generalization of 
Descartes’ rule of signs which runs as follows : from the 
equation 

f(x) = a 0 x n +na 1 x n - 1 +n (a) a 2 x n - 2 +... +na n _ l x +a„ = 0 
£orm the real simple elements a 0 , a v a 2 , a n stripped of 

* I. Nowton, Universal Arithmetic , translated by Raphson, 
revised by Wilder, 1769. (Maclaurin’s method, p. 505.) 
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their binomial coefficients. Then form the quadratic 
elements, namely 

A 0 = a 0 2 , A x = a?— a 0 a 2 , = a 2 2 —a 1 a 3 , = a n 2 . 

Let ^ l form an associated couple and ~ — ^ l an associated 

A r J H r ^4 r+1 j 

couple of successions a r a r+1 and A r A r+1 . Let p, P, v, V 
respectively denote a permanence or variation of sign in a 
succession of small or capital letters. There are evidently 
four possibilities in an associated couple, pP, vV, pV and 
vP. 


Newton's Rule : — 

(1) The number of negative roots < ZpP. 

(2) The number of positive roots ^ ZvP. 

(3) The number of complex roots > ZV. 

This rule was first proved by Sylvester, * nearly two 
centuries later, who incidentally pointed out that (1) follows 
at once from (2) on changing x to —x in f(x), while (3) 
follows from (1) and (2) directly since ZpP -\-ZvP =* 
ZP = n— ZV. Newton adapted the rule to the case 
when some of the a { vanished. The inequalities are neces- 
sary, “ but,” says Newton, “ you may know almost by 
this rule how many roots are impossible.” 

Examples. 

1. In a; 3 -f-3:r 2 -f-12:c-f 9 = 0 the a and A series are 1, 1, 
4, 9 and 1, —3, 7, 81 respectively, with two variations V of 
sign in A. Hence at least two complex roots. 

2. In 3 8 + 6# 2 -f 9# — 16 == 0 the signs are 

a _j_ — 

A + + + + 

giving EV = 0. But there are two complex roots. 

t 

* J. J. Sylvester, Proc. London Math. Soc. t 1 (1865-1866), p. 1. 
Cf. Collected Works , 2, p. 498. 
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73. Note on the Approximate Method of Newton 

(p. 92). The success of Newton’s method for evaluating 
a real root a of the equation f(x) = 0 near a point x — a 
depends upon the shape of the curve in the neighbourhood 
of the point Z where it cuts the axis of x. We must assume 
that neither f'(x ) nor f"(x) vanishes throughout the range 
of values of x corresponding to this neighbourhood. The 
first condition ensures that f{x)jf'(x) can be evaluated at 
any point of the range, since its denominator never vanishes, 
and the second condition excludes a point of inflexion 
from the arc. This arc is therefore wholly convex upwards 
(f"{x)<0) or wholly convex downwards (f"(x)> 0) : and 
therefore must be convex towards the axis of x on one 
side of Z while being concave towards this axis on the other. 
Convexity towards the axis of x is indeed the necessary 
and sufficient condition for the method to succeed. In 
the first figure (p. 92) this convexity occurs to the left of Z 
and the successive approximations b, c, . . onwards from 
x — b at B, form a monotonically increasing sequence 
whose limit is a. In the second figure (p. 93) the convexity 
occurs to the right of Z, and the sequence a , 6, c, 
onwards from x = a at A is monotonically decreasing to 
its limit a. 

To start at a point A (p. 92) from the concave side at 
once leads to a next point B on the convex side of Z ; 
but this does not guarantee that B lies within the stipulated 
neighbourhood. It may or may not. Not only might 
BZ exceed ZA , but there might also be a point of inflexion 
on the arc between Z and Q, which would vitiate the next 
approximation. 

Analytically the above condition implies that within 
the range of convexity f(x) and f"{x) must have the 
same sign (giving four cases, two with positive f'(x ) as 
illustrated by the figures, and two with negative). The 
condition is therefore necessary. 

It is also sufficient. For, in the case when each of f(x), 
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f'(x) and f"(x) is positive, we have a>b>c>. . > a. By 
the fundamental property of convergence the sequence of 
such terms a, b, c, . . must tend to a limit l which is either 
greater than a or is a itself. This implies that the series 
U(f(x)/f'(x)), where x takes the successive values a, b, c, . 
converges to the value a— l, since the series E can also 
be written (a— b)-\-(b— c)+ ••• Now if l>a, take 
m=f(l)/f'(a) which is positive. Since f(x) exceeds f(l) 
and f'(a) exceeds f'(x) within the range, each term of the 
above series exceeds this positive constant m , and the 
series must therefore fail to converge. Hence l — a only, 
which proves the result. Similarly for the other cases. 

It follows that, however poor the original approximation 
is, the error will eventually become small. We can now 
prove that if h is such a small error at a given stage of the 
sequence, the next successive errors are of orders 
h 2 , h*, h s , . . : that is, each error is of order equal to the 
square of its predecessor. This shows a remarkably high 
rate of approximation. 

To prove this, let a = a+h, where a is now quite near 
to the root a, and h is a positive or negative error. Then 

0 =f (a-\-h) — f[a)-\-h f'(a)-\- \h 2 f"(a-\-dh) 

where 0 <6< 1. By the convexity conditions both f ff (a-\-0h) 
and f'(a) are finite and nonzero, so that 


h = 


/(«) 

/'(«) 


-w 


f’(a+6h) 

/'(a) 


/(«) 

/'(«) 


+0(A a ). 


But b = a— /(«)//'(«) ; and a —a, a— 6 , which are h y 0(h 2 ), 
are the two successive errors. The result follows. 


Examples. 

1. Obtain the Newtonian sequence for the equation x 2 = 2, 
starting with a = 1. 


3 17 577 
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2. Identify the above fractions as the first, second, fourth, 
eighth, ...» 2 n -th convergents to the recurring continued 
fraction 


^ 2 = 1+ 2T2T 


3. Investigate the sequence for the equation 

a? — 3a; 2 -f 2x = 0, starting near the root 1 with a = 1*1. 

[The convexity condition is violated. The process 
leads to a more remote root zero. Draw a figure.] 

4. Show that for the equation sin x = 0 the process breaks 
down for every value of a. 


For further information on this and still more powerful 
approximations to the root of an equation the reader should 
consult a recent paper by Dr H. W. Richmond, F.R.S., in 
the Journal London Math. Soc. 19 (1944) 31-38. It is there 
shewn that the tangents PB , QC, etc., may be replaced by 
arcs of hyperbolas or higher curves which improve the 
rapidity of approximation. Thus a hyperbola with three- 
point contact would cube the error at each stage, in contrast 
to the tangent line which squares the error. 


L 
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